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EDITORIAL 


Patents, economics, and pandemics 


s coronavirus disease 2019 (COVID-19) has quickly 
killed hundreds of thousands across an unpre- 
pared world, destroyed the livelihoods of millions, 
and cost trillions of dollars, nations must now ex- 
pand their mindsets and focus not only on over- 
coming the next phases but also on long-range 
strategies—for this and for future pandemics. The 
world needs a permanent preparedness enterprise to en- 
gage in a sustained effort to avert pandemics, and provide 
for the affordable and widespread administration of vac- 
cines and therapies when they are discovered. 
Governments, academia, and industry tend to forget 
the threat of infectious diseases between pandemics, but 
when they emerge, multibillion-dollar crash programs 
are quickly put into place. This boom-and-bust cycle in 
infectious diseases research and development (R&D) has 
limited attractiveness to scien- 
tists, thus preventing real prog- 
ress. COVID-19 has mobilized the 
scientific community to generate 
hundreds of drugs, diagnostics, 
and vaccines in various stages of 
development, but this enthusi- 
asm will quickly evaporate once 
the crisis is over and funding 
dries up. Existing governmental 
and market mechanisms fail to 
protect society against present 
and future public health threats, 
leading to chronic underinvest- 
ment in infectious diseases in the 
private sector and causing many 
companies to quietly abandon the field. Nonprofit orga- 
nizations try to fill the gap but are unlikely to meet public 
health needs. Without sustained government effort and 
investment, the world will be unprepared for pandemics. 
Another reason for the market failure in pandemic pro- 
tection is an inherent misalignment between economic 
incentives driving industry versus benefits to the public. 
Private innovation is driven by the patent system. Phar- 
maceutical companies, responsive to their shareholders, 
typically invest in therapies for conditions that will pre- 
dictably maximize profits during the life of a patent. Pat- 
ents give a time-limited exclusivity to the innovator who 
can then set premium pricing that maximizes the return 
on R&D investment. Such pricing can hinder wide dis- 
semination once vaccines or therapies are developed, of- 
ten leaving many patients unable to afford these products. 
Tension therefore exists between the need to generate af- 
fordable products that preserve human health and the 
need of innovators to be appropriately rewarded for their 
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risk taking. This is more critical for vaccines as the indi- 
vidual vaccinee is not the only beneficiary. Indirect posi- 
tive externalities accrue to the whole of society through 
faster herd immunity and more rapid economic recovery. 
In economic terms, this means that such products should 
be priced as close as possible to marginal costs, a proposi- 
tion normally unattractive to drug innovators. 

How is it possible to resolve this dilemma? A system 
of ex ante economic rewards should be created based on 
specific innovation and product development milestones. 
Governments should contribute not only emergency 
grant awards (which suffer from the need to pick win- 
ners and losers prospectively with scant evidence of effec- 
tiveness) but also sustained, predictable, and prospective 
achievement rewards or even prizes based on the value 
of explicitly achieved preclinical, clinical, manufactur- 
ing, and distribution milestones, 
including advanced market com- 
mitments when necessary to 
lessen the risk for the innovators 
who deliver. In exchange, prior to 
public funds being granted—and 
to avoid conflicts later—compa- 
nies that accept taxpayer support 
would agree to make pandemic 
countermeasures available to the 
public rapidly, including wide- 
spread production and dissemi- 
nation with reasonable margins. 
Today, industry practices such a 
system of milestone-based pay- 
ments with their smaller R&D 
partners. Why should governments not do the same with 
industry for pandemics? 

Similar proposals have been advanced in Europe, with 
the potential to license discoveries internationally. CO- 
VID-19 affects all of humanity, and the strategy should 
not be geographically restricted but global. Why not join 
forces? At the upcoming G7 and G20 meetings, govern- 
ments should create and fund an international organiza- 
tion akin to a global institute of health to manage staged 
funding through calls for proposals from all sectors. This 
would sustain the R&D needed to establish rapidly scal- 
able platforms and supply chains of diagnostics, drugs, 
antibodies, and vaccines against current and future pan- 
demic threats. In exchange, innovators would be fairly 
rewarded, and their discoveries made available at afford- 
able prices to people around the world. Creating such a 
sustainable R&D enterprise would be something positive 
to emerge from the wreckage of COVID-19. 

-Will Zerhouni, Gary J. Nabel*, Elias Zerhounit 


*Employee and shareholder of Sanofi (active in COVID-19 R&D). The views expressed here reflect my personal opinion and are independent of 
policies/positions of this organization. t Board member and shareholder of Danaher and shareholder of Sanofi (both involved in COVID-19 R&D). 
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Respiratory scientist Calum Semple, to BBC, joining others ona scientific advisory panel who voiced 
worries about easing U.K. lockdowns for COVID-19 this week before testing and case tracing are fully in place. 


As protesters demonstrated on 1 June in New York City’s Times Square and other U.S. cities against the killing of an unarmed black man by Minneapolis police, 
public health specialists worried that the large crowds, in which many people did not wear masks, would quicken the spread of COVID-19. 


Edited by Jeffrey Brainard 


Yemen faces COVID-19 disaster 


HUMANITARIAN AID | Yemen, already reel- 
ing from 5 years of civil war and a massive 
cholera outbreak, faces a catastrophic 
prognosis for deaths from COVID-19 
unless international donors contribute to a 
depleted assistance fund for public health, 
the United Nations says. Based on modeling 
by a group at Imperial College London, the 
World Health Organization is bracing for 
the novel coronavirus to infect about half of 
Yemen’s population of about 30 million and 
kill an estimated 30,000 to 40,000. Official 
case numbers remain low: As of 1 June, 

the country had reported 358 cases and 

85 deaths. But there are unofficial reports 
of hundreds more cases; rival health minis- 
tries in Aden and Sana’a have accused each 
other of intentionally underreporting the 
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extent of COVID-19 in the areas they con- 
trol. Even after raising $1.35 billion from 
donors on 2 June, the United Nations is still 
short nearly $1 billion in funding needed to 
sustain humanitarian programs in Yemen 
for the rest of 2020. 


105,644 


U.S. deaths from COVID-19, 
as of 2 June. The total, the world’s 
highest, surpassed 100,000 on 
28 May. Worldwide, deaths 
have reached more than 377,000 
(Johns Hopkins University). 
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Trump confirms WHO exit 


POLicy | President Donald Trump on 

29 May said he would follow through on his 
threat to withdraw the United States from 
the World Health Organization (WHO), 
which he says has mishandled the corona- 
virus pandemic. He vowed to redistribute 
some of its allocated funding directly to 
groups it supports. Public health advocates 
blasted the decision, saying it could hamper 
a broad range of disease-fighting efforts, and 
noting it came just 11 days after Trump gave 
WHO 30 days to respond to a U.S. request for 
reforms. Legal experts say the United States 
is still obligated to pay WHO some $60 mil- 
lion in 2020 membership dues, and they note 
that an exit cannot be finalized for at least 

1 year. If Trump loses the November election, 
the next president could reverse the decision. 
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China CDC head wants changes 


PUBLIC HEALTH | The head of China’s 
national public health agency last week 
asked the government to give it more 
decision-making power and reduce 
“micromanagement” by health adminis- 
trators to address shortcomings revealed 
by the COVID-19 pandemic. Gao Fu (also 
known as George Gao), director of the 
Chinese Center for Disease Control and 
Prevention, offered a pointed critique of 
existing bureaucracies in an interview with 
China Youth Daily, an official government 
newspaper. During the crisis, the agency’s 
experts could make recommendations but 
were not included in key decisions about 
the response, Gao said. Responsibility for 
public health is splintered among govern- 
ment agencies that don’t coordinate their 
work, he added. 


The other guy’s COVID-19 


OPINION RESEARCH | Americans think 
they are less likely than peers to contract 
COVID-19, even though many worry 

about the danger it poses to the broader 
population. In surveys in March and April, 
psychologists at University College London 
found that participants rated the pandemic 
as a large danger to others (74 on a scale 

of 100, on average, with 100 representing 
“extreme danger”) but not to themselves 

(2.8 on a five-point scale, with five represent- 
ing “much less likely” to get sick). Despite 
this apparent paradox, most respondents 
followed government advice on social dis- 
tancing and other precautions, according to 
a preprint posted on the PsyArXiv server on 
29 May. Those who were more worried about 
others were more likely to follow public 
health advice, the survey found, even though 
“optimism bias” about one’s own prospects 
often leads people to take undue risks. 


Panel faults U.K. test statistics 


EPIDEMIOLOGY | The head of a U.K. agency 
that oversees government statistics this 
week criticized Secretary of State for Health 
and Social Care Matt Hancock for releas- 
ing numbers on the volume of COVID-19 
testing that were inflated “at the expense of 
understanding.” The figures mix completed 
lab results with uncompleted tests in 

which nose swabs were mailed for analysis, 
wrote David Norgrove, chairman of the UK 
Statistics Authority, in a 2 June letter to 
Hancock. The practice yields an artificially 
low rate of positive results, Norgrove said. 


SCIENCEMAG.ORG/TAGS/CORONAVIRUS 
Read additional Science coverage of the pandemic. 
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Congo tallies new Ebola cases 


n apparently new Ebola outbreak has surfaced in the Democratic 

Republic of the Congo (DRC), just as the country was close 

to declaring the disease under control. On 1 June, the DRC’s 

Ministry of Health said eight people in Mbandaka, a city in 

Equateur province, had hemorrhagic fever and four had died. 

Blood samples in three of the cases tested positive for the virus. 
In 2018, the same region quelled an Ebola outbreak that lasted 
3 months. No evidence suggests this new outbreak is linked to another, 
ongoing one in the DRC’s North Kivu province, 1000 kilometers away, 
that began just as the 2018 Equateur outbreak ended. North Kivu has 
not recorded a new case for 3 weeks. Ebola virus regularly jumps from 
animals to humans, and this is the DRC’s 11th recorded outbreak. 


Iranian nuclear work scuttled 


NONPROLIFERATION | Ratcheting up 
its pressure campaign against Iran, the 
United States announced last week it 
will cancel waivers of economic and 
legal sanctions that had allowed foreign 
organizations to help Iran reconfig- 

ure a nuclear reactor to eliminate a 
proliferation risk. Originally designed 
for research and to produce isotopes, 
Iran’s Arak heavy water reactor could 
accumulate enough plutonium in spent 
fuel to produce one or two bombs a 
year. Under the multinational nuclear 
deal negotiated with Iran in 2015 by 
former U.S. President Barack Obama’s 
administration, Iran agreed to convert 
the reactor to run on low-enriched ura- 
nium fuel, which would produce only 
traces of plutonium. Two years ago, the 
United States announced it was with- 
drawing from the deal. Last week, U.S. 
Secretary of State Mike Pompeo cited 
“Tran’s continued nuclear escalation” as 
the basis for ending the waivers, which 
also covered reducing proliferation risk 
at the Tehran Research Reactor. Iranian 
officials say they will continue to recon- 
figure Arak on their own. 


ITER builders take key step 


FUSION ENERGY | Construction crews 
last week lifted into place the first major 
piece of the $25 billion ITER project, 
which will be the world’s largest fusion 
reactor and may finally demonstrate 
that melding together hydrogen nuclei 
is a viable energy source. In last week’s 
step, about 200 workers carefully lifted 
the 1250-ton, 30-meter-wide steel base 
of the reactor’s containment chamber 
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Cranes lift the steel base of ITER’s containment 
chamber, called a cryostat, into place. 


into the construction pit near Cadarache 
in France. The COVID-19 pandemic has 
slowed progress by delaying delivery 

of some components; if key ones are 
delayed past December 2021, it may 
become difficult to complete construc- 
tion by 2025 as planned, says ITER 
Director-General Bernard Bigot. 


U.S. limits Chinese visas 


POLIcY | Chinese graduate students 
with ties to institutions in China that do 
military research can no longer receive 
a visa to study in the United States 
under an executive order issued last 
week by President Donald Trump. Those 
already in the United States with such 
ties could have their immigration status 
reviewed. The new policy is part of an 
effort by the Trump administration and 
Republicans in Congress to block what 
they see as China’s widespread theft of 
U.S.-funded academic research. Higher 
education officials worry such steps— 
including another pending bill targeting 
Chinese financial support of U.S.-based 
researchers—will limit the pool of 
foreign talent. 
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en, like this one being 
Carried into a medical 
center for COVID-19 
patients in Moscow, tend 
to get sicker from the 
coronavirus than women. 


Sex hormones signal why virus hits men harder 


Emerging role of androgens suggests potential therapies to suppress infection 


By Meredith Wadman 


n January, one of the first publications 

on those sickened by the novel corona- 

virus in Wuhan, China, reported that 

three out of every four hospitalized pa- 

tients were male. Data from around the 

world have since confirmed that men 
face a greater risk of severe illness and 
death from COVID-19 than women, and 
that children are largely spared. Now, sci- 
entists investigating how the virus does its 
deadly work have zeroed in on a possible 
reason: Androgens—male hormones such 
as testosterone—appear to boost the virus’ 
ability to get inside cells. 

A constellation of emerging data supports 
this idea, including COVID-19 outcomes in 
men with prostate cancer and lab studies 
of how androgens regulate key genes. And 
preliminary observations from Spain sug- 
gest that a disproportionate number of men 
with male pattern baldness—which is linked 
to a powerful androgen—end up in hospitals 
with COVID-19. Researchers are rushing to 
test already approved drugs that block an- 
drogens’ effects, deploying them early in 
infection in hopes of slowing the virus and 
buying time for the immune system to beat 
it back. 

“Everybody is chasing a link between an- 
drogens ... and the outcome of COVID-19,” 
says Howard Soule, executive vice president 
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at the Prostate Cancer Foundation, who 
on 13 May ran a Zoom call presenting the 
newest research that drew 600 scientists 
and physicians. A second call scheduled for 
3 June will discuss incipient clinical trials. 

Epidemiological data from around the 
world have confirmed the early reports of 
male vulnerability. In Lombardy in Italy, for 
example, men comprised 82% of 1591 pa- 
tients admitted to intensive care units (ICUs) 
from 20 February to 18 March, according to 
a JAMA paper. And male mortality exceeded 
that of women in every adult age group in 
another JAMA study of 5700 New York City 
patients hospitalized with COVID-19. 

Now, researchers are on the trail of a 
mechanism for this male bias—an effort 


A protective treatment? 

In one Italian study, men with prostate cancer who 
received drugs that suppress androgens were much 
less likely to be infected with SARS-CoV-2. 


MEN ON 


ANDROGEN- 
DEPRIVATION MEN NOT 
THERAPY (ADT) ON ADT 
Total men with 5273 37,161 
prostate cancer 
Number infected 4 114 
with SARS-CoV-2 
Estimated cases 8 31 
per 10,000 
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led by prostate cancer researchers, who 
have a deep acquaintance with androgens. 
Christina Jamieson of the University of 
California (UC), San Diego, who has devel- 
oped organoids to study prostate cancer, 
recalls that she was in a Zoom meeting 
honing ideas on how to link her research to 
COVID-19 when her sister, also a UC San 
Diego scientist, sent her a one-word text. It 
read: “TMPRSS2.” 

It was 16 April, and within minutes 
Jamieson had found the publication that 
prompted the text: a Cell paper by Markus 
Hoffmann of the Leibniz Institute for Pri- 
mate Research and colleagues. The paper 
sent a lightning bolt through the prostate 
research community, because it showed 
that infection with SARS-CoV-2, the virus 
that causes COVID-19, relies in part on TM- 
PRSS2, a membrane-bound enzyme. The 
enzyme cleaves the “spike” protein on the 
coronavirus’ surface, allowing the virus to 
fuse with the host cell’s membrane and get 
inside the cell. 

Jamieson and other prostate cancer re- 
searchers were familiar with the enzyme, 
because in about half of all prostate can- 
cers, a TMPRSS2 mutation revs up an onco- 
gene that kicks cell growth into overdrive. 
In the prostate, TMPRSS2 is produced when 
male hormones bind to the androgen recep- 
tor. “Doing research, it’s like you're trying 
to throw an anchor into the vast ocean of 
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possibilities,” Jamieson says. The discovery 
that TMPRSS2 helps the virus enter cells 
“felt like the anchor hit ground.” 

Researchers haven’t established that an- 
drogens control TMPRSS2 in the lung— 
ground zero for SARS-CoV-2 infection—as 
they do in the prostate; studies in lung tis- 
sue and cells from mice and humans come 
to conflicting conclusions. But after the Cell 
paper was published, Andrea Alimonti, head 
of molecular oncology at Universita della 
Svizzera italiana, strengthened the androgen 
link by looking at data on more than 42,000 
men with prostate cancer in Veneto in Italy. 
He and colleagues found that patients on 
androgen-deprivation therapy (ADT)—drugs 
that slash levels of testosterone—were only 
one-quarter as likely to contract COVID-19 
as men with prostate cancer not on ADT, 
they reported in the Annals of Oncology (see 
table, p. 1038). Men on ADT were also less 
likely to be hospitalized and to die, although 
numbers were small. 

Another retrospective study, still unpub- 
lished, controlled for age and other medical 
conditions and got similar results: Of 58 pa- 
tients with prostate cancer who contracted 
the coronavirus, the 22 taking ADT were sig- 
nificantly less likely to be hospitalized and to 
need supplemental oxygen, says William Oh, 
a prostate cancer physician-scientist 


randomized, placebo-controlled trial of the 
androgen-suppressing drug degarelix in 
200 veterans hospitalized with COVID-19 in 
Los Angeles, Seattle, and New York City. Pa- 
tients in the active arm will receive a single 
injection that virtually zeroes out testoster- 
one levels within 3 days. That reduces ex- 
pression of the 7MPRSS2 gene, at least in the 
prostate, to almost nil. Side effects include 
hot flashes and breast growth and “are equiv- 
alent to surgical castration,’ Rettig says. 

But whereas in prostate cancer, the in- 
jections are given month after month, 
“This study only involves a one-time dos- 
age. It’s temporary,” Rettig says. He hopes 
to learn in 4 to 5 months whether the treat- 
ment helps keep patients off ventilators 
and reduces mortality. 

Several other antiandrogen trials are 
in the offing in the United States and Eu- 
rope. Prostate cancer researcher Catherine 
Marshall of Johns Hopkins University is 
preparing a trial of bicalutamide, an older, 
inexpensive androgen receptor blocker, in 
20 patients hospitalized within 3 days af- 
ter they tested positive for COVID-19. Her 
group will compare outcomes with patients 
who don’t receive the drug. “We think that 
if this works it’s going to work by decreas- 
ing the viral load in patients,” Marshall says. 

“That’s why we are doing it earlier 
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that androgen deprivation may limit 
those severe outcomes,” Oh says. 

Two small studies have reported _ that 
men with male pattern baldness are over- 
represented among hospitalized COVID-19 
patients. This type of baldness is associ- 
ated with high levels of dihydrotestosterone 
(DHT), a key metabolite of testosterone, 
in the scalp. An April study of 41 Span- 
ish men hospitalized for COVID-19 found 
that 71% had male pattern baldness; the 
background rate in white men is esti- 
mated at 31% to 53%. A second study 
published last month found that 79% of 
122 men in three Madrid hospitals with 
COVID-19 had male pattern baldness. 

More circumstantial evidence comes from 
stem cell biologist Faranak Fattahi, of UC 
San Francisco. Her team found a strong link 
between a measure of active androgens in 
the blood and the severity of COVID-19 dis- 
ease in data from several hundred male pa- 
tients in the UK Biobank; they did not find 
this effect in women. 

Such evidence is already inspiring pos- 
sible therapies. Matthew Rettig, an onco- 
logist who directs prostate cancer research 
at UC Los Angeles, is leading a double-blind, 
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trogens have been shown to help 
heal acute lung injury. Bicalu- 
tamide raises estrogen levels as well as 
suppressing androgen activity. Marshall 
says of the emerging wave of trials: “All 
these trial ideas have been team science at 
its best and probably at its fastest.” 

Adding to the promise of antiandrogens is 
lab-based evidence from Fattahi’s study. Her 
team screened Food and Drug Administra- 
tion (FDA)-approved drugs in heart cells in 
the lab to see which ones reduced levels of the 
essential SARS-CoV-2 receptor, angiotensin- 
converting enzyme 2 (ACE2). Key hits in- 
cluded finasteride and dutasteride, drugs 
that block the conversion of testosterone 
to DHT, according to a 15 May preprint. 
Finasteride is FDA-approved to treat male 
pattern baldness and dutasteride for pros- 
tate enlargement. Dutasteride also reduced 
ACE2 levels in healthy human lung alveo- 
lar cells. 

Although researchers pursuing the an- 
drogen link caution that their hypothesis 
remains just that until it is borne out in 
lab and clinical studies, they’re optimis- 
tic. “When all evidence points to the same 
thing it’s very satisfying,” Fattahi says. 
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COVID-19 


Blood vessel 
injury may spur 
disease’s fatal 
second phase 


Damage to vessel lining may 
drive mysterious clotting 
disorders, inflammation 


By Catherine Matacic 


rank Ruschitzka told his pathologist to 

be ready before the first COVID-19 pa- 

tient died. In early March, Ruschitzka, 

who leads the cardiology department at 

University Hospital Ziirich, noticed that 

patients with the disease had strange 
symptoms for what was then thought to be 
chiefly a respiratory infection. Many patients 
had acute kidney failure, organ damage, and 
mysterious blood clots. Several weeks later, 
the first body was autopsied: Tiny clots and 
dead cells littered the capillaries of the lungs, 
and inflammation had distended blood ves- 
sels supplying every organ in the body. 

The pathologist had never seen anything 
like it. But the results showed Ruschitzka 
why his patients were suffering so much: 
The virus had targeted their blood vessels. 

Since the Ziirich team’s findings were 
published in mid-April, dozens of studies 
have revealed similar patterns of vascular 
damage in people who died of COVID-19. 
For example, a 21 May paper in The New 
England Journal of Medicine showed that 
the lungs of COVID-19 victims had nine 
times as many clots as those who died of 
the H1N1 flu. Other studies have noted in- 
flammatory symptoms in children (Science, 
29 May, p. 923) and strokes in otherwise 
healthy young adults. Now, researchers 
have woven these findings into a new hy- 
pothesis explaining why some patients slip 
into a fatal “second phase” of COVID-19, 
1 week or so after hospitalization. 

The key is direct and indirect damage to 
the endothelial cells that line the blood ves- 
sels, particularly in the lungs, explains Peter 
Carmeliet, a vascular biologist at the Bel- 
gian research institute VIB and co-author of 
a 21 May paper in Nature Reviews Immuno- 
logy. By attacking those cells, COVID-19 
infection causes vessels to leak and blood 
to clot. Those changes in turn spark inflam- 
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mation throughout the body and fuel the 
acute respiratory distress syndrome (ARDS) 
responsible for most patient deaths. 

“Tt’s a vicious cycle,” says Nilam Mangal- 
murti, a pulmonary intensivist at the Hos- 
pital of the University of Pennsylvania, who 
was not involved in the new research. 

This mechanism could explain why the 
disease pummels some patients who have 
obesity, diabetes, and cardiovascular condi- 
tions: The cells lining their blood vessels 
are already compromised. If so, drugs used 
to treat these conditions might help prevent 
other COVID-19 patients from sliding into 
serious disease. “[A vaccine] would be ter- 
rific,’ says Richard Becker, a cardiologist at 
the University of Cincinnati College of Medi- 
cine who outlined a similar 
cardiovascular cascade in a 
15 May review in the Journal 
of Thrombosis and Throm- 
bolytis. But until a safe, ef- 
fective vaccine is available, 
he says, such therapeutics 
might be “a good start.” 

In healthy individuals, 
endothelial cells help regu- 
late blood pressure, prevent 
inflammation, and inhibit 
clotting, in part through 
the continual production 
of nitric oxide (NO); they 
also serve as gatekeepers 
for molecules passing in 
and out of the bloodstream. 
When injured, they send out 
a complex array of signals to 
immune cells and clotting 
factors, which rush to re- 
pair the site. And they warn 
their fellow endothelial cells 
to be on alert for invaders. 

Based on autopsy reports 
like those from the Ziirich 
hospital, the epidemiology 
of the disease, and how the 
new coronavirus behaves in 
cells in the lab, Carmeliet 
and colleagues believe the 
virus can send that system 
spinning out of control. 

When SARS-CoV-2 enters 
the lungs, it invades cells in 
the air sacs that transfer oxy- 
gen to the blood. Surround- 
ing those sacs are capillaries 
lined like bricks with endo- 
thelial cells. The virus di- 
rectly invades some of those 
cells; others become “acti- 
vated,’ likely in response to 
signals from the invading vi- 
rus and other damaged cells. 
Some infected cells likely 
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commit suicide. “It’s not a quiet death where 
the cell just dies,’ Mangalmurti says. “All the 
contents leak out.” 

Carmeliet and colleagues suggest dam- 
age and other changes in the activated cells 
trigger vascular leakage, flooding the air 
sacs with fluid, a hallmark of ARDS. White 
blood cells swarm to the lungs and NO 
production likely plummets. Together with 
the activated endothelial cells, the immune 
cells release a host of signaling molecules, 
including interleukins, which raise local 
blood pressure and weaken cell junctions. 
Damage to the endothelial cells also ex- 
poses the membrane underneath them. 

That exposed membrane in turn triggers 
uncontrolled clotting. The endothelial and 


A cascade of injury 
Anew hypothesis suggests SARS-CoV-2 
attacks the endothelial cells that line the 
blood vessels surrounding the lungs’ air 

sacs, or alveoli. A spiral of damage can 
result. Injured endothelial cells cause fluid 
to leak out of vessels, trigger blood clotting, 
and summon a host of immune cells and 
messenger molecules that cause wide- 
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immune cells add fuel to the fire, recruit- 
ing additional clotting factors and plate- 
lets, which help form clots. Those clots 
degrade into the key biomarker D-dimer, 
creating the sky-high levels that alert cli- 
nicians to patients in trouble (see graphic, 
below). Eventually, such clotting spreads 
throughout the body and blocks the blood 
supply within vital organs. 

These chain reactions culminate in a 
final, destructive phase of inflammation. 
Like clotting, inflammation is an essential 
defense, sending a diverse army of cells 
and messenger molecules called cytokines 
to fight invaders and mop up the debris 
of battle. But in COVID-19, this reaction 
spirals out of control in a deadly cytokine 
storm and plunges patients’ 
bodies into shock. 

Ruschitzka says the 
three-step hypothesis 
“makes perfect sense” of 
what he saw in his patients; 
he’s already sending the 
Carmeliet paper to col- 
leagues. He says the array of 
pathways may also explain 
why some young people 
without known risk factors 
for COVID-19 become seri- 
ously ill: They might have 
undiagnosed clotting or au- 
toimmune disorders, such 
as rheumatoid arthritis, that 
amplify the effects of SARS- 
CoV-2 infection. 

This emerging view of 
the key role of endothelial 
cells suggests that a num- 
ber of existing drugs might 
dampen or even arrest the 
fatal second phase of the dis- 
N ease, Becker says. Already, 
evidence that inflammation 
and clotting play a role in 
COVID-19 has inspired doz- 
ens of trials in the United 
States and Europe of anti- 
clotting, anti-inflammatory, 
and antiplatelet drugs. 

Ruschitzka thinks an- 
other commonly prescribed 
drug might help: statins. 
Typically taken to lower 
cholesterol, they also reduce 
inflammation and improve 
endothelial cell function. 

Mangalmurti welcomes 
such trials, but cautions 
that patients may respond 
differently depending on 
how healthy their endothe- 
lial cells are to start. “One 
size does not fit all.” m 
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COVID-19 


The pandemic’s first major 
research scandal erupts 


Critics question patient data used to challenge malaria drugs 


By Kelly Servick and Martin Enserink 


nits face, it was a major finding: Anti- 

malarial drugs touted by the White 

House as possible COVID-19 treat- 

ments looked to be not just ineffective, 

but downright deadly. A study pub- 

lished on 22 May in The Lancet used 
hospital records procured by a little-known 
data analytics company called Surgisphere to 
conclude that COVID-19 patients taking chlo- 
roquine or hydroxychloroquine were more 
likely to show an irregular heart rhythm—a 
known side effect thought to be rare—and 
more likely to die. Within days, large ran- 
domized trials of the drugs screeched to 
a halt. Solidarity, the World Health Orga- 
nization’s (WHO’s) megatrial of potential 
COVID-19 treatments, paused recruitment 
into its hydroxychloroquine arm. 

But just as quickly, the results have begun 
to unravel—and Surgisphere, which pro- 
vided patient data for two other high-profile 
COVID-19 papers, has come under withering 
online scrutiny from researchers and ama- 
teur sleuths. They have pointed out many 
red flags in the Lancet paper, including the 
astonishing number of patients and details 
about patient demographics and dosing that 
seemed implausible. “It began to stretch and 
stretch and stretch credulity,” says Nicholas 
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White, a malaria researcher at Mahidol Uni- 
versity in Bangkok. 

As Science went to press, The Lancet issued 
an Expression of Concern, noting “serious 
scientific questions” about its paper. Hours 
earlier, The New England Journal of Medi- 
cine (NEJM) issued an Expression of Concern 
about a second study using Surgisphere data, 
published on 1 May. The paper reported that 
taking certain blood pressure drugs including 
angiotensin-converting enzyme (ACE) inhibi- 
tors didn’t increase the risk of death among 
COVID-19 patients, as some researchers had 
suggested. The journal asked the authors “to 
provide evidence that the data are reliable.” 

A third study using Surgisphere data is 
also under fire. In an April preprint, Surgi- 
sphere founder and CEO Sapan Desai and co- 
authors concluded that ivermectin, an anti- 
parasitic drug, dramatically reduced mortal- 
ity in COVID-19 patients. In Latin America, 
where ivermectin is widely available, that 
study led some officials to authorize use of 
the drug, creating a surge in demand. 

Chicago-based Surgisphere has not pub- 
licly released data underlying the studies. 
On 2 June, Desai told Science through a 
spokesperson that he was “arranging a non- 
disclosure agreement that will provide the 
authors of the NEJM paper with the data ac- 
cess requested by NEJM.” And in a 29 May 
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A hydroxychloroquine study is being audited. 


statement, Surgisphere defended the integ- 
rity of its research and said it was pursu- 
ing “an independent academic audit” of its 
results in The Lancet. The journal and non- 
Surgisphere authors also said data reviews 
were underway. 

The episode has left leaders of halted hy- 
droxycholoroquine trials weighing whether 
to restart. “The problem is, we are left with 
all the damage that has been done,” says 
White, a co-investigator on a halted trial for 
COVID-19 prevention. It will now be hard to 
recruit people to key studies, he says. “The 
whole world thinks now that these drugs are 
poisonous.” 

Desai co-authored the Lancet paper with 
cardiologist Mandeep Mehra of Harvard 
University’s Brigham and Women’s Hospi- 
tal (BWH), cardiologist Frank Ruschitzka of 
University Hospital Ziirich, and cardiac sur- 
geon Amit Patel, who listed affiliations with 
the University of Utah and HCA Research 
Institute in Nashville, Tennessee. (Mehra and 
Patel referred inquiries to BWH. Ruschitzka 
did not respond to requests for comments.) 
The authors describe an analysis of electronic 
health record data from patients already 
treated for COVID-19 at 671 hospitals on six 
continents—nearly 15,000 people prescribed 
chloroquine or hydroxychloroquine, alone 
or in combination with an antibiotic, and a 
control group of 81,000 other patients. After 
adjusting for potentially confounding factors, 
the researchers found the risk of dying was 
9.3% for the control group versus 23.8% for 
those getting hydroxychloroquine alongside 
an antibiotic. 

In a 25 May media briefing, WHO Direc- 
tor-General Tedros Adhanom Ghebreyesus 
cited the results in announcing a “temporary 
pause” in Solidarity’s hydroxychloroquine 
arm. Regulators in France and the United 
Kingdom also instructed investigators, in- 
cluding White’s team, to halt enrollment in 
trials. And Sanofi said it would temporarily 
stop recruiting patients to two trials of its hy- 
droxychloroquine formulation. 

Other researchers immediately took issue 
with the analysis. The study does not properly 
control for the likelihood that patients get- 
ting the experimental drugs were sicker than 
the controls, says Matthew Semler, a critical 
care physician at Vanderbilt University. And 
White notes anomalies in the data. Although 
66% of the patients were reportedly treated in 
North America, the reported doses tended to 
be higher than the guidelines set by the U.S. 
Food and Drug Administration. And the au- 
thors claim to have included 44.02 patients in 
Africa, but it seems unlikely that African hos- 
pitals would have detailed electronic health 
records for so many patients, White says. The 
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study also reported more deaths in Australian 
hospitals than the country’s official COVID-19 
death statistics, The Guardian reported. On 
29 May, The Lancet issued a correction say- 
ing a hospital assigned to the study’s “Aus- 
tralasia” group should have been assigned 
to Asia and updating a supplemental table. 
“There have been no changes to the findings 
of the paper,’ it says. 

The brief response left some researchers 
frustrated. “This was very, very annoying,” 
says James Watson, a statistician at Mahidol 
who on 28 May published an open letter—now 
signed by more than 140 researchers—that 
calls for the release of Surgisphere’s hospital- 
level data, an independent validation of the 
results, and publication of the peer-review 
comments that led to the Lancet publication. 
“The Lancet encourages scientific debate and 
will publish responses to the study, along 
with a response from the authors,” a journal 
spokesperson said in a response. 

On 2 June, many of the same researchers 
and others published an open letter to NEJM 
and the authors of the ACE inhibitor study, 
citing similar problems in that paper. It notes 
inconsistencies including a discrepancy be- 
tween the small number of hospitals in each 
country that are said to have shared patient 
data with Surgisphere and the high propor- 
tion of those countries’ confirmed COVID-19 
cases included in the study. 

Oddities also appear in the ivermectin 
study, says Carlos Chaccour of the Barcelona 
Institute for Global Health. There’s evidence 
that ivermectin, the key weapon in the global 
campaign against river blindness, also has 
antiviral properties. The 6 April preprint, 
co-authored by Patel, Desai, and Mehra, 
along with David Grainger of the University 
of Utah, used Surgisphere data reportedly 
collected at 169 hospitals around the world 
between 1 January and 1 March. It included 
three patients in Africa who received 
ivermectin—even though only two COVID-19 
cases had been reported in all of Africa by 
1 March, Chaccour and two colleagues note 
in a recent blog post. 

Chaccour says after he inquired about 
the discrepancy, the authors posted a sec- 
ond, longer version of the manuscript on 
19 April, containing data collected between 
1 January and 31 March. The new manu- 
script reported that ivermectin reduced the 
need for mechanical ventilation by 65% and 
slashed the death rate by 83%. But the re- 
vision had other problems, Chaccour and 
his colleagues wrote in their blog post. For 
example, the data shown in a figure were 
wildly different from those reported in the 
text. (Grainger also did not reply to a re- 
quest for a comment.) 

In response to the ivermectin study the 
Peruvian Ministry of Health modified its 
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COVID-19 treatment protocol to include 
ivermectin (as well as hydroxychloroquine) 
for mild and severe cases of COVID-19; 
demand for the drug in Peru has surged. 
In Trinidad, Bolivia, the city government 
aimed to hand out more than 350,000 free 
doses of ivermectin after the country’s Min- 
istry of Health authorized its use against 
COVID-19. 

Surgisphere’s sparse online presence— 
the website doesn’t list partner hospitals by 
name or identify its scientific advisory board, 
for example—has prompted intense skepti- 
cism. Physician and entrepreneur James 
Todaro of the investment fund Blocktown 
Capital wondered in a blog post why Surgi- 
sphere’s enormous database doesn’t appear 
to have been used in peer-reviewed research 
studies until May. Chaccour asks how such a 
tiny company—LinkedIn lists only a hand- 
ful of employees—was able to reach data- 
sharing agreements with hundreds of hospi- 
tals around the world. 

Desai’s spokesperson says the company 
has 11 employees and has been developing its 
database since 2008. 

The potential of hydroxychloroquine for 
treating COVID-19 has become a political 
flashpoint. French microbiologist Didier 
Raoult, whose own widely criticized studies 
suggested a benefit from the drug, derided 
the Lancet study in a video posted on 2 June, 
calling the authors “incompetent.” 

For scientists running randomized trials 
of hydroxychloroquine, an urgent question 
has been how to respond to the paper and 
the ensuing flap. A trial funded by the U.S. 
National Heart, Lung, and Blood Institute 
opted to keep running after its data and 
safety monitoring board (DSMB) reviewed 
safety data from already enrolled partici- 
pants, says Semler, a co-investigator on 
the study. WHO’s paused Solidarity trial is 
awaiting similar review from its DSMB, says 
Soumya Swaminathan, the organization’s 
chief scientist. 

The controversy is an unfortunate distrac- 
tion, says Miguel Hernan, a Harvard epidemi- 
ologist and co-investigator on an ongoing trial 
of hydroxychloroquine in Spain and Latin 
America. “If you do something as inflamma- 
tory as this without a solid foundation, you 
are going to make a lot of people waste time 
trying to understand what is going on.” Chac- 
cour says both NEJM and The Lancet should 
have scrutinized the provenance of Surgi- 
sphere’s data more closely before publishing 
the studies. “Here we are in the middle of 
a pandemic with hundreds of thousands of 
deaths, and the two most prestigious medical 
journals have failed us,” he says. 


With reporting by Rodrigo Pérez Ortega, Charles Piller, 
and John Travis. 
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Shuttered 
natural history 
museums fight 
for survival 


But creative scientists vow 
recovery and move research 
and public programs online 


By Elizabeth Pennisi 


few months ago, retirement was 

the furthest thing from David 

Thomas’s mind. “Then the world went 

upside down,” recalls the archaeo- 

logist from the American Museum 

of Natural History in New York City. 
In March, the coronavirus pandemic forced 
the museum to close its doors. No more 
school groups thronging the interactive ex- 
hibits, no more corporate dinners or lines 
of international tourists waiting to pay 
$23 a head to marvel at fossils. The mu- 
seum’s income plummeted 60%. 

Leaders first asked for early retirements. 
By early May, they had sliced the staff of 
1100 by 20% and furloughed an additional 
250 staff members. Many full-time employ- 
ees now work 3 days a week, mostly from 
home. Thomas opted to retire early, along 
with four of the other 38 curators. “It was 
the right thing to do,” he says. 

Around the world, natural history mu- 
seums are shuttered and reeling. Last 
week, the California Academy of Sciences 
announced it was furloughing or laying 
off 40% of its staff. “We will recover, but 
there is no doubt that we will be in some 
ways a different institution,” says Peter 
Roopnarine, a paleontologist there. 

Museums’ reliance on revenue from 
ticket sales and events makes them among 
the first scientific institutions to feel the 
economic impact of the COVID-19 pan- 
demic. “I worry about the long-term health 
of all natural history museums and the col- 
lections that are in our sacred trust,’ says 
Shannon Hackett, an ornithologist at the 
Field Museum of Natural History in Chi- 
cago. “It will be very challenging for some 
museums to reopen at all,” adds Scott 
Cooper, who runs Drexel University’s Acad- 
emy of Natural Sciences in Philadelphia. 

But the crisis is also spurring museums 
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to adopt or expand practices that, though 
they may not restore lost revenue, are 
keeping the public engaged and research 
ticking along: an online biodiversity con- 
test, public discussions on Zoom, a web- 
cam streaming captive corals. Curators are 
also expanding and refining digital collec- 
tions that are accessible to both the public 
and homebound researchers. 

“We are seeing more changes in the mu- 
seum industry at this moment than we 
could push people to make previously,’ 
says Julie Stein, director of the Burke Mu- 
seum of Natural History and Culture at the 
University of Washington, Seattle, whose 
own institution has been devastated. The 
Burke Museum had opened a new build- 
ing in October 2019 and was “headed for 
record-breaking revenue,” Stein says—until 
the entire campus shut down on 6 March. 

Some museums, including the Smith- 
sonian Institution’s National Museum 
of Natural History in Washington, D.C., 
have dodged financial cliffs thanks to 
government support. The Natural History 
Museum, London, stayed afloat with emer- 
gency support from the U.K. government, 
but furloughed half its staff until the end 
of June. Similarly, the Field Museum has 
thus far avoided layoffs thanks to a cash re- 
serve and the federal paycheck protection 
program, says President and CEO Richard 
Lariviere. But 30% of the museum’s income 
comes from tickets and related activities, 
and $17 million is already lost. Given that 
cases of COVID-19 have yet to peak in Il- 
linois, Lariviere doubts the museum will 
open this summer, and worries he will be 
forced to make layoffs. 

Some university museums managed to 
avoid layoffs now, but may pay a price later 
if university budgets shrink. Harvard Uni- 
versity’s Peabody Museum of Archaeology 
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and Ethnology will likely not reopen as 
quickly as stand-alone museums, says Di- 
rector Jane Pickering. 

As they worry about the future, re- 
searchers are also distraught because 
they can’t pursue their current research. 
Travel restrictions have brought fieldwork 
to a screeching halt—and with it, the ad- 
dition of more specimens to collections. 
The American Museum of Natural His- 
tory alone has canceled 100 expeditions. 
And researchers can’t get into buildings 
to analyze existing collections. “We have 
been cut off from our collections, facilities, 
and colleagues,’ says Anjali Goswami, a 
paleobiologist at the Natural History Mu- 
seum, London. 

One trend accelerated by the crisis could 
help: efforts to digitize natural history col- 
lections. At Harvard’s Museum of Com- 
parative Zoology, staff working from home 
have been busy enhancing the millions of 
records in the museumwide database, for 
example adding latitude and longitude 
coordinates to specimens thus far identi- 
fied only by location names. “There’s a 
tremendous amount of data locked into 
collections,” says Kirk Johnson, director of 
the National Museum of Natural History, 
where detailed digital images of pressed 
plants in the herbarium allow research- 
ers to scrutinize them from afar. “We are 
now shining the light on the dark data 
of museums.” 

Rebecca Albright, a coral biologist at the 
California Academy of Sciences, is study- 
ing the mysteries of coral spawning; only 
one other research team has been able to 
get the corals to reproduce in a lab set- 
ting. Recently, Albright identified just the 
right conditions, including water tempera- 
ture and lighting that re-create changing 
day length and the cycling of the Moon, to 
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The Field Museum of Natural History hosted a 
socially distanced blood drive in its empty halls. 


prompt spawning. When she learned that 
she couldn’t be in the museum at the key 
time, she and her colleagues set up an in- 
frared webcam. “We never set up a camera 
before because we didn’t need to,” she says. 

The livestreaming camera allowed them 
to catch spawning in the act on 22 April, 
Earth Day—and made a big splash on the 
web. “If we had missed this, we would have 
had to wait a whole year,’ Albright says. 
The corals now have 1.6 million followers. 

Other scientists have refocused their re- 
search on the pandemic itself. Roopnarine 
previously studied how nature recovered 
from mass extinctions. Now, he is re- 
purposing his computer models of eco- 
system recovery to evaluate how various 
employment schemes may get economies 
back on track as lockdowns ease. “Our 
work has never been more relevant than it 
is now,” he says. 

Many see the pandemic as an opportu- 
nity for change. “I’m doing more public pro- 
grams than ever—in virtual formats,” says 
Sabrina Sholts, a biological anthropologist 
at the National Museum of Natural History. 

To engage the public in research, the 
Natural History Museum of Los Angeles 
County and the California Academy of 
Sciences in April enlisted thousands of 
citizen scientists in a global biodiversity 
effort called the City Nature Challenge. 
Participants gathered thousands of images 
of birds, insects, and other wildlife in more 
than 250 cities to help researchers study 
urban ecosystems. 

The pandemic itself is inspiring new di- 
rections. Leonard Krishtalka, director of 
the University of Kansas Biodiversity Insti- 
tute and Natural History Museum, wants 
museums to expand their focus to include 
microbes and viruses. 

At the National Museum of Natural His- 
tory, Sholts was already thinking along 
those lines when she pulled together a 
temporary exhibit inspired by the 2014 
Ebola epidemic. “Outbreak,” which ex- 
plores how pathogens spread between ani- 
mals and people, opened 2 years ago. After 
COVID-19 erupted, the exhibit was repli- 
cated online, with a digital version in half 
a dozen languages; a DIY kit has already 
been adapted in 41 countries and 30 U.S. 
states and territories. 

“We are in the process of reinventing 
what natural history museums are for,” 
Johnson says, speaking by phone to a re- 
porter as he walked past the darkened 
halls of the Outbreak exhibit. “Museums 
can play a much more impactful role than 
they have in the past 50 years.” 
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European Commission 
President Ursula von der Leyen 
has proposed a €1.85 trillion 
budget that bets on science. 


Europe bets R&D spending will 
bring jobs to battered economy 


Horizon Europe gets €13.5 billion to spend fast, spur growth 


By Nicholas Wallace 


he European Union wants a massive 

dose of research spending to lift it out 

of what could be the worst recession 

in its history. Last week, as part of a 

€1.85 trillion budget and pandemic 

recovery proposal, the European Com- 
mission, the EU executive arm, unveiled plans 
to pump €94.4 billion into research over 
7 years, nearly €11 billion more than origi- 
nally planned for the program, called Ho- 
rizon Europe. But not everyone thinks the 
money is the best medicine. 

The Commission says R&D spending 
will drive productivity, employment, and 
competitiveness. Based on its own mod- 
els, it has estimated that Horizon 2020, the 
€80 billion predecessor to Horizon Europe, 
will ultimately create up to 179,000 jobs and 
add €400 billion to €600 billion to gross do- 
mestic product (GDP) by 2030. Likewise, the 
new infusion “will ensure that we come out 
stronger from this crisis,” said EU research 
commissioner Mariya Gabriel in a statement. 

But Julia Lane, an economist at New 
York University who studies the effects of 
R&D spending, says Europe is not tracking 
the data needed to justify its predictions 
and, in any case, its numbers aren’t cred- 
ible. If the return on R&D investment for 
GDP is that good, “why don’t they just shut 
the entire economy down and move it into 
that?” she asks. “Just on the face of it, it’s a 
ludicrous number.” 

The Horizon Europe budget still has to 
be negotiated with national governments. 
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But the Commission wants €13.5 billion of 
the research money to be spent fast, within 
4 years, on projects in four broad categories: 
health; climate, energy, and mobility; digital, 
industry, and space; and the new European 
Innovation Council (EIC). The Commission 
will define specific topics and call for grant 
proposals from researchers; it will also con- 
sider making equity investments in startups 
through EIC (Science, 10 April, p. 120). 

In the short term, research to curtail 
the pandemic will do the most to help the 
economy, says Thomas Estermann, head of 
funding policy at the European University As- 
sociation, a lobby group. “Each week of con- 
finement and lockdown has a tremendous 
economic impact,” he says. “The solution to 
the crisis will clearly come from research.” 
The money won’t be available until January 
2021, however, and only a tiny portion of the 
€13.5 billion is likely to fund COVID-19 treat- 
ments or vaccines. 

In the long term, the value of R&D spend- 
ing is plain to everyone, says Robert-Jan 
Smits, president of the Eindhoven University 
of Technology and former head of the Com- 
mission’s research policy department. “If I 
ask a person here in the streets, ‘What are the 
sources that bring us prosperity and wealth?’ 
he or she will say, ‘education and technol- 
ogy,” he says. “It’s not even rocket science.” 

While at the Commission, Smits brought 
in economists to develop one of the models 
that predicts economic outcomes based on 
historical data. The model, called NEMESIS, 
“can calculate for each investment, each euro 
that you invest in science and innovation at 
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EU level, what will be the impact—even on 
jobs, even on growth,” he says. 

But without detailed accounting from re- 
search centers, it is impossible to make rea- 
sonable estimates of job creation, says Lane, 
who in a former role at the U.S. National Sci- 
ence Foundation helped develop metrics to 
track job creation from the Obama adminis- 
tration’s 2009 stimulus package, which put 
billions into research. She says setting up the 
infrastructure to do that in the United States 
took years. 

According to Lane, the U.S. data show 
some impact: For example, researchers are 
more likely to found successful startups, 
which contribute disproportionately to job 
creation and productivity. But more than 
10 years later it still isn’t clear what kinds of 
science funding work best, she says. Europe, 
meanwhile, has “wasted the last 10 years” by 
failing to set up similar procedures, she says. 

Smits agrees that tracking the immediate 
impact of research investment is “a bit com- 
plex.” But he argues that identifying longer 
term impacts in models is “doable,” because 
NEMESIS, unlike conventional models, fac- 
tors in R&D spending. “I find it just com- 
pletely bizarre that people still work in the 
finance ministries with models which are not 
giving any recognition to key technologies, 
which are changing our lives on a daily ba- 
sis’ Smits says. 

Hans-Olaf Henkel, a former member of the 
European Parliament and former president 
of the Leibniz Association, an association of 
German research institutes, objects to the 
Commission’s top-down approach to stimu- 
lating the economy. “Any kind of centralized 
Brussels program, in my view, is doomed 
to fail;’ he says. He argues politicians don’t 
know what kind of science is most likely to 
succeed, and should therefore concentrate on 
bottom-up funding. “They should let the sci- 
entists do their creative work and not come 
up with political goals.” The €13 billion Euro- 
pean Research Council, one of the few parts 
of Horizon 2020 that focuses on bottom-up 
funding and is meant to be immune from 
political interference, was originally slated 
to get a €3.6 billion rise in Horizon Europe, 
but the reallocation of money under the new 
budget puts that figure in doubt. 

Regardless of how the new budget shakes 
out, Andrés Rodriguez-Pose, an economic geo- 
grapher at the London School of Economics, 
points out that EU grants are “a drop in the 
sea.” Most R&D funding in Europe comes 
from the private sector, and most of the rest 
comes from national governments, not the 
European Union. “The capacity of the EU in- 
stitutions to shape the way research is done 
in Europe is limited,” he says. ® 


Nicholas Wallace is a journalist in Brussels. 
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U.S. SCIENCE POLICY 


Bill would supersize NSF’s budget—and role 


Legislation calls for $100 billion increase, new technology directorate, and new name 


By Jeffrey Mervis 


he National Science Foundation (NSF) 

would get a huge infusion of cash, as 

well as a new name and new respon- 

sibilities for keeping the United States 

on top in technological innovation, 

under bipartisan bills introduced late 
last month in both chambers of Congress. 
But some science policy veterans are ques- 
tioning whether a basic research agency 
should also be expected to spearhead the de- 
velopment of new technologies. 

The Endless Frontier Act (S. 3832 and H.R. 
6978) would create a technology directorate 
at NSF with a budget that could grow to 
$35 billion by 2024—more than four times 
the agency’s existing $8 billion budget. That 
would bring NSF to rough parity with the 
National Institutes of Health, whose $41 bil- 
lion budget makes it by far the government’s 
biggest funder of basic research. 

The bill would authorize spending 
$110 billion over 5 years, with $100 billion 
for NSF (see chart, right) and $10 billion 
for the Department of Commerce to set up 
a dozen or so regional technology hubs to 
promote innovation in areas not currently 
tech hot spots. NSF would also get a second 
deputy director to oversee all technology 
activities and a new name: the National 
Science and Technology Foundation. 

The funding levels are aspirational; Con- 
gress would still have to appropriate the 
money even if the bill were adopted. But ac- 
ademic leaders view the legislation as a huge 
vote of confidence in NSF, which turned 
70 this year. “These funds—which comple- 
ment, not supplant, existing resources, an 
important condition—build on the NSF’s 
strengths and would fill gaps in our research 
enterprise,’ says Rafael Reif, president of 
the Massachusetts Institute of Technology. 

The legislation addresses the chronic 
underfunding of NSF, says Neal Lane, 
who spent 5 years as NSF director before 
becoming science adviser to former Presi- 
dent Bill Clinton. “NSF gets enough good 
ideas to justify” a much larger budget, he 
says. “This bill makes clear that it’s time 
for such bold action.” 

But Arden Bement, who led NSF under 
former President George W. Bush, says 
other federal agencies already have the 
mission of supporting applied technology 
and development. And no federal agency, 
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he adds, will ever have enough resources 
to substitute for what industry spends on 
commercializing innovation. 

The bill’s name invokes the title of the 
seminal 1945 report by presidential science 
adviser Vannevar Bush that made the case 
for federal support of academic research 
and led to NSF’s creation in 1950. But the 
Stay Ahead of China Act might be a more 
accurate moniker, based on what lawmak- 
ers say they hope it will accomplish. 

“China and others are stealing American 
intellectual property and aggressively in- 


Dividing up the tech dollars 
Lawmakers have proposed spending $110 billion 
over 5 years to boost U.S. innovation. The Commerce 
Department would receive $10 billion to establish 

up to 15 regional technology hubs with industry 
partners, whereas the National Science Foundation 
(NSF) would get $100 billion, distributed across 
these six categories: 


University- 35% For basic research, 

based technology prototyping, and support 

centers of regional hubs 

NSF priorities 20% Allocated as needed 
within NSF and to other 
U.S. agencies 

Education and 15% For scholarships, 

training fellowships, 


and traineeships 


Research, including 15% Money channeled 


social and ethical through various NSF 
concerns programs 
Testbeds and 10% Support for facilities 
fabrication and processes to shorten 
time to market 
Fostering 5% Strengthening the 
entrepreneurship innovation enterprise 
in academia 


vesting in research and commercialization 
to dominate the known technology fields of 
the future,” the four co-sponsors—Senators 
Charles Schumer (D-NY) and Todd Young 
(R-IN) and Representatives Ro Khanna 
(D-CA) and Mike Gallagher (R-WI)—say in 
the preface to their bills. “The country that 
wins the race in key technologies—such as 
artificial intelligence, quantum computing, 
advanced communications, and advanced 
manufacturing—will be the superpower of 
the future,” they add. 

The legislation calls on NSF to fund an 
unspecified number of university-based 


Published by AAAS 


technology centers in those and other 
fields. The centers would be an order of 
magnitude bigger than NSF’s existing en- 
gineering and science centers, which have 
annual budgets of up to $5 million. Part- 
nering with industry scientists, the centers 
would both carry out fundamental re- 
search and develop prototypes of high-tech 
products and processes. NSF would also 
receive billions to expand its six science di- 
rectorates, boost investment in education 
and training, and set up facilities to test 
new technologies. 

The legislation could significantly alter 
how NSF operates, with the Defense Ad- 
vanced Research Projects Agency (DARPA) 
as a model. It allows NSF to adopt DARPA 
practices, including fixed-term appoint- 
ments of experts from the private sector 
and a focus on tangible, deadline-driven 
results. “The new [technology] director- 
ate can run like DARPA if NSF wants it to,” 
says one university lobbyist familiar with 
Schumer’s thinking. 

Because NSF currently gives grantees 
freedom to pursue curiosity-driven research, 
adopting a DARPA-like model “would be a 
huge cultural shift,” says Joel Parriott, direc- 
tor of public policy for the American Astro- 
nomical Society and a former White House 
budget official whose portfolio included 
NSF. “It’s also not clear how the technol- 
ogy directorate could operate so differently 
from the rest of the agency.” 

The bill’s prospects this year are iffy in 
a Congress consumed by the coronavirus 
pandemic and with scant time to legislate 
before the fall elections. One option is at- 
taching it to a reauthorization of defense 
programs that is seen as must-pass legisla- 
tion. But in the meantime, David Hart, a 
science policy expert at George Mason Uni- 
versity, says its “symbolic value” shouldn’t 
be ignored. 

“It’s a bipartisan statement that the 
country is underinvesting in key tech- 
nologies,” Hart says. “I’m not sold on do- 
ing all of this at NSF. But we, as a nation, 
have to come up with new ways to fund 
technology. This is certainly a breath- 
taking proposal.” 

Bement disagrees. “Action on this bill 
should be tabled for another day,” he says. 
Instead, he suggests Congress first “deter- 
mine whether the system is broken and, if 
so, in what ways.” 
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DOUBLE TROUBLE 


The brightest supernovae, used as cosmic yardsticks, have an 
unexpected trigger: pairs of dead stars 


he white dwarf stars were zip- 
ping across the Milky Way at 
more than 1000 kilometers per 
second—thousands of times faster 
than a speeding bullet, so fast that 
they would eventually escape the 
gravitational clutches of the Gal- 
axy. “They were not like anything 
we had seen before,’ says Boris 
Gaensicke, an astronomer at the University 
of Warwick. 

Gaensicke and his colleagues suspected 
these burnt-out embers were fleeing 
scenes of violence: supernova explo- 
sions in which another white dwarf 
had detonated like an Earth-size hy- 
drogen bomb. In the standard pic- 
ture of these explosions, known as 
type Ia supernovae, a nearby giant 
star lights the fuse. But the extreme 
speed of the white dwarfs suggested 
a different scenario, in which the flee- 
ing dwarfs had delivered the sparks, 
from close orbits around the doomed 
stars. When they blew up, these part- 
ners were flung away like shots from a 
biblical sling. 

The speeds of the three white dwarfs, 
discovered in a 2018 data set from Eu- 
rope’s Gaia satellite, were just one clue 
to this picture. Subsequent ground- 
based observations found traces of 
iron and other metals in the stars’ light—ele- 
ments that might have been implanted by a 
supernova blast. From the color and bright- 
ness of the light, the astronomers could de- 
duce that the stars were hotter and larger 
than typical white dwarfs, as if they had been 
puffed up by a sudden energy boost. Most 
telling, the researchers rewound the stars’ 
trajectories and found that one hailed from a 
known site: the remnant of a 90,000-year-old 
supernova. They are “the very best evidence” 
for the twin white dwarf scenario, says team 
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By Daniel Clery 


leader Ken Shen of the University of Califor- 
nia (UC), Berkeley. 

The evidence that twin white dwarfs 
drive most, if not all, type Ia super- 
novae, which account for about 20% of the 
supernova blasts in the Milky Way, “is 
more and more overwhelming,” says Dan 
Maoz, director of Tel Aviv University’s Wise 
Observatory, which tracks fast-changing 
phenomena such as supernovae. He says 
the classic scenario of a white dwarf 
paired with a large star such as a red giant 


Kepler's supernova, a type la explosion from 1604, lingers today with 
the x-ray glow of hot, leftover debris. 


“doesn’t happen in nature, or quite rarely.” 

Which picture prevails has impacts 
across astronomy: Type Ia supernovae play 
a vital role in cosmic chemical manufac- 
turing, forging in their fireballs most of 
the iron and other metals that pervade 
the universe. The explosions also serve as 
“standard candles,” assumed to shine with 
a predictable brightness. Their brightness 
as seen from Earth provides a cosmic yard- 
stick, used among other things to discover 
“dark energy,’ the unknown force that is 
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accelerating the expansion of the universe. 
If type Ia supernovae originate as paired 
white dwarfs, their brightness might not 
be as consistent as was thought—and they 
might be less reliable as standard candles. 

Robert Kirshner, a longtime supernova 
watcher at the Gordon and Betty Moore 
Foundation, isn’t ready to give up on the 
classic scenario, but he acknowledges the 
misgivings about it. “It’s plausible that there 
is more than one path to a type Ia super- 
nova, but the uniformity of the light out- 
put is a tiny bit of a paradox,” he says. 
Nevertheless, “There’s a nagging 
doubt: Do we fully understand the na- 
ture of these explosions?” 


SUPERNOVAE COME in two basic fla- 
vors. The most common are called 
core-collapse. They occur when a mas- 
sive star, far larger than the Sun, runs 
out of fuel. The pressure of the core’s 
heat can no longer counteract grav- 
ity; the outer parts of the star collapse 
into the core with such force that a 
rebounding shock wave, or sometimes 
a renewed fusion burn, blows out the 
star’s outer layers, leaving a neutron 
star or black hole behind. 

The rest are all type Ia—white 
dwarf stars somehow reignited into 
a runaway fusion reaction. The burst 

of new energy happens too fast for the 
star to absorb, blowing the entire thing to 
smithereens in a blast that is brighter and 
longer than a core-collapse supernova. 
White dwarfs might seem unlikely can- 
didates for fireworks. They are the cinders 
of Sun-like stars that have burned up their 
hydrogen and helium fuels, leaving—in most 
cases—carbon and oxygen, heavier elements 
that can’t fuse in such a low-mass star. White 
dwarfs shrink to the size of Earth, and only 
glow from leftover heat. Theoretically, they 
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Nuclear options 


Type la supernovae, critical as 
cosmic rulers and chemical forges, 
are explosions of white dwarfs. But 
why would a burned-out star go 
nuclear? For decades, researchers 
thought a giant companion star 
donated gas to the white dwarf until 
it was heavy enough to ignite. Two 
alternative theories have emerged, 
both involving white dwarf pairs in 
spiraling dances to the death. 


Classic scenario 
The white dwarf steals gas, gaining enough 
weight to start carbon fusion—leading to 
a runaway explosion. 


Hydrogen 
ap Red giant 

Hydrogen layer (Not to scale) 
builds up 

| 

Vv 

2 Core ignites 
Red giant 
survives 

v 

Supernova 


Mass = 1.4 Suns 


should cool to black over billions of years. 
But if the white dwarf orbits in a binary 
pair with another star, a more spectacu- 
lar fate may await it. The classic type Ia 
scenario was proposed in 1973 by John 
Whelan and Icko Iben. They mapped out 
the fading light of supernova SN 1972e for 
one full year, and realized its brightness 
could be explained by the decay of about 
one solar mass worth of radioactive met- 
als. These, they proposed, were forged in 
a white dwarf that had grown to a size at 
which the pressure and temperature in its 
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White dwarf merger 
Merging white dwarfs generate heat to 
spark carbon fusion. But the explosions may 
be superficial and lopsided. 


( White dwarf binary ) 


\ 


Orbits shrink as stars 2 
emit gravitational 


waves | 
White dwarfs 2) ~_ Heat of merger 
merge sparks fusion 


Supernova 
Mass 2 1.4 Suns 


core would be high enough to fuse carbon, 
causing a thermonuclear blast. Whelan 
and Iben suggested the white dwarf might 
grow to that mass threshold if its grav- 
ity stole hydrogen gas from a companion 
star such as a puffed-up red giant, which 
doesn’t clutch its outer layers too tightly. 
Later modeling showed achieving this 
growth was a tricky balancing act. If a 
white dwarf gobbles up hydrogen too fast, 
the hydrogen layer that forms on its sur- 
face can get hot enough to blow up pre- 
maturely, in a more modest thermonuclear 
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Stellar cinder 
When a Sun-size star 
runs out of hydrogen 
and helium fuel, it 
collapses to a white 
dwarf, an Earth-size 
ember made mostly 
of carbon and 
oxygen. Left alone, 
it could smolder 
for billions of years. 
But a more explosive 
fate may await if it is 
paired with another star. 


Double detonation 
Just before a merger, one dwarf steals a thin 
layer of helium that detonates, igniting 
a bigger explosion in the core. 


White dwarfs 
on point 
of merger / ' 


Helium is 
pulled across 


¢ 
Helium layer Shock waves 
explodes ignite core 

¥ Second dwarf 


thrown from blast 


Supernova 
Mass < 1.4 Suns 


explosion called a classical nova or, if it 
happens repeatedly, a recurrent nova. If 
it accumulates too slowly, the white dwarf 
can tiptoe up to a mass 1.44 times that of 
the Sun. At that threshold, known as the 
Chandrasekhar limit, theorists predict the 
pressure inside will cause electrons and 
protons to fuse into neutrons, and the 
white dwarf will quietly collapse into a 
neutron star. 

But if the hydrogen is added at just the 
right rate, a white dwarf, especially one 
rich in carbon and oxygen, can respond 
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more dramatically. Just short of the Chan- 
drasekhar limit, at about 1.4 solar masses, 
the density and temperature of the core 
shoot up. Flares of carbon fusion break 
out. After smoldering for 100 or more 
years, a runaway reaction detonates the 
star and blows it apart in a matter of sec- 
onds. The resulting fireball, 5 billion times 
as bright as the Sun, forges a suite of met- 
als from chromium to nickel in the peri- 
odic table. The radioactive decay of nickel 


Models also suggest a flash of blue light 
should appear, hours after the supernova 
begins, as the expanding fireball slams into 
the tenuous hydrogen atmosphere of the 
red giant and heats it enough to glow in the 
ultraviolet. But when astronomers spotted 
a supernova in the nearby Pinwheel galaxy 
within hours of its ignition in 2011, they saw 
no blue flash. “SN 2011fe was a paradigm 
changer,” says Stan Woosley, a supernova 
expert at UC Santa Cruz. 


Theorists have struggled to simulate classic type la explosions. This is one of their successful models. 


to cobalt and then iron powers a brilliant 
afterglow that peaks in a couple of weeks 
and fades over years. And because, in this 
picture, every type Ia explodes with the 
same mass, they should all have the same 
peak brightness. 


THAT SCENARIO satisfied astronomers for 
decades. But they have yet to find definitive 
evidence for it. After a supernova disperses 
enough to be transparent, astronomers 
routinely search for a surviving red giant 
companion but have never found one. 
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A second predicted signal—weaker but 
more persistent—has also been elusive. 
Astronomers would expect some hydro- 
gen from the red giant to be swept along 
with the other explosion debris, creating 
an absorption line in the spectrum of the 
supernova’s light as the remnant cools. But 
a 2019 study of 227 type Ia supernova rem- 
nants found no hint of this hydrogen. 

The numbers are against the classic sce- 
nario, Maoz says. In a galaxy like the Milky 
Way, a type Ia supernova occurs once ev- 
ery few hundred years. If they all originate 
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from a white dwarf and a red giant, the 
galaxy would have to host some 10,000 of 
these pairs—and there’s little evidence of 
them. Because it’s hard to see binary pairs 
as separate stars at such great distances, 
astronomers have looked for indirect evi- 
dence, such as the recurrent novae trig- 
gered when hydrogen from a red giant 
spills quickly onto a white dwarf compan- 
ion, or the soft x-ray glow that can result 
from a steady stream of hydrogen. If the 
Milky Way harbored 10,000 
white dwarf-red giant pairs, 
we should see many novae and 
soft x-ray sources, but only a 
handful have been found, Maoz 
says. “We're missing 99.9% of 
the systems.” 

The theoretical underpin- 
nings of the classic scenario 
have problems, too. A key as- 
sumption is that the explosion 
is so violent that the entire star 
is blown apart. But theorists 
have had a hard time model- 
ing how the smoldering fusion 
flares would burgeon into an 
all-consuming detonation. Mod- 
eling that process requires sim- 
ulating nuclear reactions with 
centimeter resolution across 
an object the size of Earth for 
100 years or more. Until recently, 
computers couldn’t handle that 
challenge—and now that they 
can, Shen says, only some mod- 
els predict the transition. “The 
state of the field is not conclu- 
sive yet.” 


ENTER ANOTHER WAY to blow up a 
white dwarf. Astronomers know 
that two white dwarfs orbiting 
each other—a so-called double 
degenerate—can gradually spi- 
ral inward as their rapid orbits 
throw out gravitational waves, 
which carry away energy. When 
they merge, there would be eas- 
ily enough heat and pressure to 
kick-start carbon fusion in the 
combined stars. 

Models suggest a problem with this mech- 
anism: The burning would not start in the 
core. It might kick off in the hot zone where 
the two are actively colliding, leading to a lop- 
sided and incomplete explosion that would 
leave much of the stars’ mass unburned. Or 
it could ignite close to the surface, only blow- 
ing off the outer layers of the merged star. 
“Getting a detonation in a double degenerate 
is by no means trivial,’ Woosley says. 

Yet Maoz says the odds are irresist- 
ible: White dwarf binaries are thought to 
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be fairly common. In a 2018 study, Maoz 
and his colleagues looked for the wobble 
of white dwarfs being tugged by a partner, 
combining survey results from the Sloan 
Digital Sky Survey and Europe’s Very Large 
Telescope. They concluded that about half 
a billion white dwarf binaries have merged 
in the Milky Way since its formation. If just 
one-sixth of those mergers led to a type Ia 
supernova, that would be one supernova 
every 200 years—roughly what is observed 
in the Milky Way. Another team, using Gaia 
data, came to a similar conclusion. 

Maoz is undeterred by the problem of 
getting a complete, symmetrical blast. 
“Just because we don’t know how it hap- 
pens doesn’t mean nature hasn’t found 
a way.” In fact, many believe nature has 
found a way to blow up one member of a 
white dwarf pair—but without a merger. 
White dwarfs can have some leftover he- 
lium in their atmospheres after the core 
stops burning. When an orbiting pair is 
on the cusp of merging, the larger of the 
two stars can rapidly steal helium from the 
smaller one to form a dense helium layer 
on its surface. The helium layer can act as 
a kind of blasting cap, exploding in a small 
thermonuclear blast and sending a shock 
wave into the star that can ignite the core. 

This scenario is called D6, for dynami- 
cally driven double-degenerate double 
detonation. The idea was first developed 
in 2010 by James Guillochon, a researcher 
at the Harvard-Smithsonian Center for 
Astrophysics, and his colleagues. It leaves 
the smaller white dwarf battered but 
thrown free, like those Shen’s group found 
in the Gaia data. D6 was originally thought 
to require a hefty amount of helium, mak- 
ing it a rare event, but more recent model- 
ing suggests just a few percent of a solar 
mass could be enough, Woosley says. 


ONE KEY FEATURE of the D6 scenario is that 
the exploding white dwarf can be well be- 
low the critical mass, because the spark 
comes from a shock wave and not from 
gravitational pressure. A less massive ex- 
ploding star will produce less nickel and 
be less bright. 

Recent studies of the metallic elements 
supernovae forge suggest the low-mass 
type Ia may be the norm. According to 
models, the production of manganese in 
type Ia supernovae is particularly sensi- 
tive to density in the white dwarf’s core: 
If the star is close to the 1.4-solar mass 
threshold, its high-density core produces 
lots of manganese; if the star is lighter—as 
is likely in the D6 mechanism—it produces 
one-tenth as much. 

As a result, manganese abundances de- 
rived from the light of stars today can hint 
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at the masses of the ancient supernovae 
that seeded them with heavier elements. 
“Manganese provides an indirect way 
to probe previous generations of type la 
supernovae that went off in that galaxy,” says 
Ashley Ruiter of the University of New South 
Wales, Canberra. 

In a pioneering study from 2013, re- 
searchers led by Ivo Seitenzahl, then at the 
Julius Maximilian University of Wurzburg, 
compared manganese abundance in the 
Sun with models of how much manganese 
would be produced by supernovae of dif- 
ferent masses. They found that only half of 
the supernovae that exploded in the solar 
neighborhood in the past needed to be high 
mass to explain the Sun’s manganese con- 
tent. “This was the first of a new wave of 
results,” says Maria Bergemann of the Max 
Planck Institute for Astronomy. This year, 
she and her colleagues reported looking 
at manganese in 42 stars across the Milky 
Way and concluded that the abundances 
suggest 75% of the galaxy’s type Ia super- 
novae were low mass. 


“It’s areal golden age 
for supernovae because 
we're finding so many.” 


Andy Howell, 
Las Cumbres Observatory 


The implications of undersize type Ia su- 
pernovae extend far beyond the elements 
in the present-day universe. They also 
raise questions about the explosions’ long- 
standing role as “standard candles” for 
probing cosmic history. 

In 1998, researchers compared a few 
dozen distant type Ia supernovae with 
closer ones and found that they were dim- 
mer than they should have been. They 
concluded that the universe’s expansion 
is accelerating, driven by some unknown 
dark energy—a discovery for which they 
were awarded the 2011 Nobel Prize in 
Physics. Supernova distances are also at 
the heart of a dispute over the value of the 
expansion rate itself, known as the Hubble 
constant (Science, 10 March 2017, p. 1010). 
In the nearby universe the expansion rate 
is measured using standard candles such 
as type Ia supernovae; in the distant, early 
universe, it is derived from clues such as 
the cosmic microwave background, the 
echo of the big bang. When the effect of 
dark energy is taken out, the two values 
should agree—but they don’t. 

Could a not-so-standard candle jeopardize 
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those discoveries? “It means something, but 
not that dark energy goes away,’ Woosley 
says. Dark energy has been confirmed us- 
ing other methods, so he’s not worried about 
that. But he thinks cosmologists will run into 
trouble as they put their theories to more 
rigorous tests that require more precise stan- 
dard candles. “Supernovae could be less use- 
ful for precision cosmology,’ he says. 

Astronomers already knew the peak 
brightness of type Ia supernovae isn’t per- 
fectly consistent. To cope, they have worked 
out an empirical formula, known as the Phil- 
lips relation, that links peak brightness to 
the rate at which the light fades: Flashes that 
decay slowly are overall brighter than those 
that fade quickly. But more than 30% of type 
Ia supernovae stray far from the Phillips re- 
lation. Perhaps low-mass D6 explosions can 
explain these oddballs, Shen says. For now, 
those who wield the cosmic yardstick will 
need to “throw away anything that looks 
weird,” Gaensicke says, and hope for the best. 

Andy Howell, a supernova watcher at 
Las Cumbres Observatory, thinks type Ia 
supernovae could still be reliable tools for 
cosmology if astronomers could separate the 
different varieties of type Ia that are now 
lumped together. “If we knew there were two 
populations, we could make the measure- 
ments even better,” he says. 

So far, astronomers can’t say how many of 
their favorite cosmic explosions are sparked 
by white dwarf pairs rather than a giant and 
a dwarf. “It’s too early to say with certainty 
what that fraction is,’ Ruiter says. But the 
coming years could bring more clarity. 

Survey telescopes that scan the skies 
nightly or even hourly are catching more 
and more supernovae. The current frontrun- 
ner, the Zwicky Transient Facility in Califor- 
nia, spots about 30 supernovae per night. 
Its output will be dwarfed in 2022 with the 
opening of the Vera C. Rubin Observatory, an 
8.4-meter survey telescope in Chile that is 
expected to find thousands of supernovae 
nightly. Other telescopes able to obtain spec- 
tra from thousands of objects simultane- 
ously will enable astronomers to study the 
explosions for the features—the blue flash, 
the hydrogen absorption lines—that could 
betray the involvement of a giant star. 

Shen and Gaensicke hope the next data re- 
lease from Gaia will contain more high-speed 
white dwarfs fleeing from D6 explosions. And 
the Laser Interferometer Space Antenna, an 
orbiting gravitational wave detector due for 
launch in 2034, will be able to sense white 
dwarf pairs as they spiral in toward merger, 
giving astronomers a better idea of how com- 
mon they really are. “It’s a real golden age for 
supernovae because we're finding so many,’ 
Howell says. “We’ve now finally got the tools 
to see them in new ways.” 
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Blue carbon from the past forecasts the future 


Coastal mangrove ecosystems must rise above sea level to survive 


By Catherine E. Lovelock 


hen knowledge of a system is im- 
perfect, thresholds are extremely 
useful tools for decision-makers. 
Sea level rise (SLR) threatens in- 
tertidal coastal wetlands because 
these aquatic plants will drown 


1050 _ = 5 JUNE 2020 + VOL 368 ISSUE 6495 


as water depths exceed their physiological 
tolerance limits. For mangroves—trees and 
shrubs that grow in tidal waters of the trop- 
ics or subtropics—the threshold for SLR 
has been elusive. On page 1118 of this issue, 
Saintilan et al. (1) deduce that the threshold 
SLR for mangrove ecosystems is 6 to ’7 mm/ 
year. This discovery can help inform deci- 
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sions on how to sustain mangroves, which 
provide 200 million coastal people with es- 
sential ecosystem services. These include 
protection from intense storms and waves, 
reduction in the impact of coastal flooding, 
sequestering of carbon, improvement in wa- 
ter quality, and preservation of biodiversity 
and fisheries (2). 


sciencemag.org SCIENCE 


n 
wy 
a 
=} 
= 
is) 
a 
Zz 
io 
a 
z 
x 
a 
= 
iri] 
ied 
iL 
Zz 
ry] 
oO 
id 
> 
2 
fe} 
z 
je} 
x= 
a 


Saintilan et al. studied ancient mangrove 
sediments to infer a threshold of relative 
SLR for mangrove tolerance. The term “rela- 
tive” is used because the rate of SLR is deter- 
mined not only by the increase in water vol- 
ume of the oceans but also by subsidence or 
uplift of coastal land. Subsidence and uplift 
result, in part, from glacial isostatic adjust- 
ment, which describes a complex set of geo- 
physical processes that vary geographically. 
The authors used sediment cores to collate 
the dates when mangrove vegetation first ap- 
peared in a variety of locations over the past 
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10,000 years. Mangroves can be detected 
from their distinctive, highly organic sedi- 
ments, which reflect mangrove ecosystems’ 
great productivity and conditions that favor 
preservation of vast quantities of organic 
carbon. The data obtained from sediment 
cores, sampled by a range of researchers 
from 78 sites on five continents, indicate that 
mangrove ecosystems did not develop unless 
relative SLR was less than 6 to 7 mm/year. 
The global mean rate of SLR is now 
3.4 mm/year and is projected to exceed 
the threshold and reach ~10 mm/year by 
2100 under “business-as-usual” scenarios 
(greenhouse gas concentration trajectory 
Representative Concentration Pathway 8.5) 
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Sea level rise threatens coastal mangrove forests. 
Disruption of these ecosystems endangers coastal 
communities and decreases carbon sequestration. 


(3). However, SLR could stabilize at ~5 mm/ 
year by the year 2100 under moderate-emis- 
sions scenarios in which countries reduce 
carbon dioxide (CO,) emissions (3). Clearly, 
reducing CO, in the atmosphere is a first 
line of defense against passing the relative 
SLR threshold for mangrove persistence. 
However, in many mangrove locations, rates 
of relative SLR are already higher than 6 to 7 
mm/year. Now, in the Anthropocene, subsid- 
ence of coastal land is also caused by extrac- 
tion of oil, gas, and water (4). For example, 
the Mekong Delta of Vietnam is subsiding at 
arate of 6 to 20 mm/year (5) and the Ganges- 
Brahmaputra Delta by 1 to 7 mm/year (6), 
which is accompanied by land erosion and 
saltwater intrusion. At the same time, sedi- 
ment supply to the coast, which is vital to 
counteracting the effects of relative SLR as 
it contributes to raising the seafloor, has de- 
clined; this is because rivers are dammed 
and, in some cases, sediment has been mined 
and exported, all of which further increase 
the vulnerability of mangroves to SLR (7). 

Saintilan et al. also found variation 
around the mean threshold value of 6 to 7 
mm/year, indicating that local factors, such 
as subsidence and sediment supply, drive 
location-specific relative SLR thresholds for 
mangroves. Reduction in extraction of water, 
oil, and gas from floodplains and enhanc- 
ing sediment supply might avoid crossing 
the SLR threshold for mangrove persistence. 
Dam removal and the delivery of nourishing 
sediments and water to the coast are prac- 
ticed in Europe and North America but have 
little uptake in the tropics (8). Improvement 
of water quality and reduction of overex- 
ploitation will help maintain mangrove 
health by facilitating strong root growth, 
which also contributes to vertical accretion 
(9). Balancing the freshwater and sediment 
needs of mangroves that provide natural 
ecosystem services against water storage, 
irrigated agriculture, and hydropower is an 
emerging series of trade-offs that policy- 
makers must face (10). 

One key consideration in these trade-offs 
is that under moderate SLR, mangroves miti- 
gate CO, in the atmosphere by storing it in 
the plants and sediment of the ecosystem 
(so-called blue carbon) (71). Saintilan et al. 
used their sediment core data to estimate the 
amount of CO, sequestered over ~1400 years 
of mangrove development and reached a 
conservative estimate of 85 Pg of stored car- 
bon, which is larger than the estimated car- 
bon stored in boreal peats as they developed 
through the same Holocene period. These 
historical estimates verify that sequestration 
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of blue carbon in coastal ecosystems makes 
substantial contributions to the mitigation 
of global climate change (12). 

Without sufficient sediment and root 
growth to reduce the impacts of SLR, man- 
grove cover might be sustained through 
landward migration of mangroves onto 
floodplains. This would increase carbon 
sequestration, but landward retreat is com- 
plex and can be costly for governments and 
landholders (73). The threshold provided by 
Saintilan et al., combined with knowledge 
of local factors that influence relative SLR, 
provides opportunities to develop time lines 
for decisions and actions that will maintain 
mangroves in the land-seascape. 

There are caveats to consider in the new 
study. Over the late Holocene, rates of SLR 
were mostly declining, whereas Earth cur- 
rently faces accelerating SLR, which might 
alter the threshold. There could be lags in 
responses of mangrove cover to relative SLR 
that exceed the 6- to 7-mm/year threshold, 
and lags can vary with species traits and dis- 
turbance regimens, such as frequency and 
intensity of storms that exacerbate the im- 
pacts of SLR and which are predicted to in- 
tensify with climate change in many regions. 

Nevertheless, providing an evidence-based 
relative SLR threshold for mangrove survival 
can help stimulate solutions for coastal man- 
agement. If nations and communities wish 
to harness the potential of blue carbon to 
mitigate climate change and to protect mil- 
lions of people who depend on mangroves 
for shelter, flood protection, food, and fiber, 
then solutions for staying below the 6- to 
7-mm/year relative SLR threshold should 
be the goal of civil society and governments. 
With reduction in CO, emissions, equitable 
management of river water and sediment 
flows, limitations in water and oil extraction 
on floodplains, and conservation and plan- 
ning for landward migration, mangroves can 
be maintained into the future. 
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The specifics of innate 
immune memory 


Innate immune cells develop specific memory 
involved in transplant rejection in mice 


By Jorge Dominguez-Andrés? 
and Mihai G. Netea?? 


ne of the most important traits of 

immune host defense against patho- 

gens is memory, which improves 

survival if the same pathogen is 

reencountered. However, immune 

memory can also be deleterious, driv- 
ing autoimmune diseases and the rejection 
of transplanted organs. Memory character- 
istics have been considered a fundamental 
property of adaptive immune cells such as T 
and B lymphocytes (7). However, innate im- 
mune cells such as myeloid cells and natural 
killer (NK) cells can also adapt to previous 
encounters with pathogens through epigen- 
etic, transcriptional, and functional repro- 
gramming, called trained immunity (2). The 
discovery of this innate immune memory 
emerged from studies with live vaccines 
and was described as being largely nonspe- 
cific (3). On page 1122 of this issue, Dai et al. 
(4) reveal that monocytes and macrophages 
acquire specific memory and induce organ 
rejection in mice, which could be prevented 
to improve transplantation outcomes. 

Dai et al. use a series of elegant transplant 
models in different mouse strains, showing 
that the rejection of grafts is maintained in 
individuals without a functional adaptive im- 
mune response. Graft rejection was mediated 
by monocytes and macrophages, which re- 
tained their memory functions several weeks 
after the encounter with antigens from the 
donor. This specific memory response was 
dependent on paired immunoglobulin-like 
receptor-A (PIR-A), an activation receptor ex- 
pressed by B cells and myeloid lineage cells, 
which can recognize major histocompatibil- 
ity complex class I (MHC-I) molecules (see 
the figure). MHC-I expressed on the surface 
of most nucleated cells presents peptide frag- 
ments of endogenous proteins, which can be 
subsequently recognized as antigens by the 
immune cells of the recipient of a transplant 
and cause its rejection (5). Blocking or delet- 
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ing PIR-A receptors impaired the recognition 
of MHC-I-peptide complexes expressed in 
the donor cells, which decreased the rejec- 
tion of transplanted hearts and kidneys and 
improved outcomes. 

This specificity of innate immune mem- 
ory in mouse myeloid cells described by 
Dai et al. is reminiscent of memory char- 
acteristics of innate immune cells of sev- 
eral invertebrates, which lack an adaptive 
immune system. These include diversifica- 
tion of genes encoding fibrinogen-related 
proteins (FREPs) in mollusks (6) and scav- 
enger receptor cysteine-rich proteins in 
echinoderms (7) or alternative splicing of 
the immunoglobulin (Ig) domain-encoding 
gene Down syndrome cell adhesion mol- 
ecule (Dscam) in insects (5). It is important 
to note that in both vertebrates and inver- 
tebrates, the Ig superfamily of molecules is 
often the pillar mediating specificity of the 
innate immune memory responses. Indeed, 
PIR-A, B cell receptors, immunoglobulins, 
and T cell receptors share a similar type of 
structure, which argues for an evolutionary 
continuum of memory specificity in innate 
and adaptive immune responses. 

Ig-based specific immune _ responses 
complement the more primitive nonspecific 
trained immunity-mediated memory (8). 
The increased responsiveness provided by 
trained immunity after certain infections or 
vaccinations can induce protection against 
both specific and heterologous infections 
(9, 10). Yet, in conditions characterized by 
excessive immune responses, such as in- 
flammatory and autoimmune diseases and 
organ transplant rejection, enhanced innate 
immune responses can aggravate the patho- 
logical consequences of inflammation. It is 
crucial to know the roles played by innate 
immune cells in transplant rejection, in or- 
der to target it and improve survival. 

The findings of Dai et al. have several im- 
plications that go beyond transplantation. 
If monocytes and macrophages can develop 
specific memory to MHC-I-presented anti- 
gens, this reveals possible therapeutic ap- 
proaches in organ transplantation, but also 
autoimmune and inflammatory diseases. 
In addition, it is intriguing to hypothesize 
whether specific innate immune memory 
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responses could also form against patho- 
gens, as proposed for NK cells in response 
to viral infections (2). 

Macrophages are heterogeneous tissue- 
resident cells: Yolk sac- and fetal liver 
monocyte-derived tissue-resident macro- 
phages colonize different tissues during 
embryonic development (1/7). These are 
long-lived and able to self-renew, so the 
specific characteristics of these macro- 
phages rely on their niche (7). For example, 
lung macrophages are exposed to airway 
antigens, whereas Kupffer cells in the liver 
are exposed to gut-derived molecules (72). 
The phenotype of a microglial cell in the 
brain greatly differs from that of a perito- 
neal macrophage (7). The potential ability 
of these and other myeloid cell subsets to 
develop different types of immunological 
memory to antigens could offer additional 
evidence for their different functions across 
tissues and help to explain the development 


antigen-specific memory to antigens pre- 
sented by donor-derived MHC-I molecules, 
it is likely that these and other myeloid cells 
can develop immunological memory to 
self- and nonself antigens of different 
sources. Given the variety of receptors ex- 
pressed by the different types of myeloid 
cells, the implications of these mechanisms 
could encompass many processes beyond 
organ transplantation, including the re- 
sponse to pathogens, vaccines, inflamma- 
tory and autoimmune diseases, or the de- 
velopment of allergies. Dai et al. identified 
several families of polymorphic Ig super- 
family receptors that could bind to MHC-I 
molecules. The extension of this search to 
other receptors that can bind antigens of a 
different nature could lead to the identifica- 
tion of other structures with the potential 
to trigger or block antigen-specific memory 
in myeloid cells, potentially offering a new 
set of targets for immunotherapy. 


Different forms of innate immunological memory 

Some myeloid cells can develop memory. After a first stimulus, retained epigenetic changes can facilitate 
the production of cytokines after a nonspecific stimulus (trained immunity). Macrophages can also develop 
specific memory through paired immunoglobulin-like receptor-A (PIR-A) specificity to antigens presented by 


major histocompatibility complex class | (MHC-1). 
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of different types of memory responses to 
the same antigens in different locations. In 
this context, it will be crucial to address if 
the antigen-specific responses by myeloid 
cells can also be inhibitory and may be in- 
volved in the development of immunologi- 
cal tolerance leading to the lack of respon- 
siveness against harmless molecules such as 
antigens expressed by commensal bacteria, 
or self-antigens. Immunological tolerance 
prevents harmful immune responses in the 
host, whereas failure of these mechanisms 
results in tissue damage and autoimmune 
responses (13). Indeed, polymorphisms 
in genes of the human leukocyte antigen 
(HLA) system, the human version of MHC, 
are associated with celiac disease, inflam- 
matory bowel disease, rheumatoid arthri- 
tis, psoriasis, type 1 diabetes, and multiple 
sclerosis (14, 15). Additional studies have 
found genetic associations between MHC-I 
and infectious diseases such as HIV, human 
hepatitis B virus, and tuberculosis (14). 

If monocytes and macrophages develop 
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DEVELOPMENT 


Testicular-borne 
factors affect 
sperm fertility 


Immature testicular germ 
cells secrete factors that 
influence sperm maturation 
in mice 


By Tessa Lord! and Jon M. Oatley” 


ale fertility relies on the genesis of 

sperm in seminiferous tubules of 

the testis and their maturation dur- 

ing transit through the epididymis. 

Mouse models with impaired devel- 

opment of the most proximal region 
of the epididymis, the initial segment (IS), 
possess sperm that are morphologically nor- 
mal but incapable of fertilizing an egg (J). It 
has been postulated that factors synthesized 
in the testis are released into the lumen of tu- 
bules (lumicrine factors) and influence devel- 
opment and function of the IS. However, the 
identity of such factors has remained elusive. 
On page 1132 of this issue, Kiyozumi et al. 
(2) identify and characterize the first known 
lumicrine factor, the germ cell-secreted pro- 
tein neural epidermal growth factor-like like 
2 (NELL2). They demonstrate a key role for 
NELL2 in driving development of the IS of 
mice, culminating in the production of IS- 
secreted proteases that are indispensable for 
sperm processing in the epididymis and thus 
male fertility. 

The epididymis is a convoluted ductal sys- 
tem of up to 1 m in length in mice and 6 m 
in humans. Testicular sperm must transit the 
length of the epididymis to gain the capacity 
for fertilization, including activation of the 
machinery that drives motility as well as egg 
recognition and binding. Given that sperma- 
tozoa are transcriptionally and translation- 
ally silent, the process of sperm maturation 
is largely attributed to activities of epididy- 
mal epithelial cells, which can include post- 
translational modification, augmentation, 
and processing of sperm proteins. Proteomic 
comparisons of sperm collected from the 
more proximal regions of the epididymis 
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(caput) with sperm from the distal region 
(cauda) reveal differential expression in more 
than 50 proteins (3) as well as 77 phospho- 
protein changes (4). 

Given the importance of the epididymis in 
sperm maturation, aberrations in epididymal 
function can be catastrophic for male fertil- 
ity. Although endocrine factors such as an- 
drogens are integral for epididymal develop- 
ment during embryonic and early postnatal 
life, studies with mice indicate that lumicrine 
factors are critical for differentiation of the IS 
from postnatal day 15 (P15) to P19 (5). Indeed, 
when the efferent ductiles that connect tes- 
ticular seminiferous tubules and epididymal 
tubules are ligated during development, thus 
preventing the flow of lumicrine factors into 
the epididymis, the epithelial cells of the IS 
do not undergo differentiation and default 
into an apoptotic pathway (6, 7). Although 
testicular lumicrine factors likely stimulate 


and how they stimulate the epithelial cells of 
the IS to differentiate and influence sperm 
maturation. 

Kiyozumi et al. provide evidence that lu- 
micrine factors that drive IS development 
are produced by testicular germ cells (which 
produce spermatozoa). They find that NELL2 
binds directly to the ROS1 receptor, and in- 
activation of the Nell2 gene phenocopies the 
RosI-deleted mice. Specifically, Ned/2-deleted 
mice have a poorly differentiated IS and pro- 
duce sperm that cannot transit the uterotubal 
junction in the female reproductive tract or 
bind the glycoprotein “shell” (called the zona 
pellucida) that surrounds the oocyte, thus 
culminating in infertility. 

Kiyozumi et al. highlight that impaired 
IS development attributed to lumicrine in- 
terruption becomes appreciable from 2 to 
3 weeks of age, coinciding with the appear- 
ance of the first spermatocytes in the testis. 


Lumicrine regulation of epididymal development and function 


Lumicrine factors, such as neural epidermal growth factor-like like 2 (NELL2), are produced by germ cells in the testis during 
maturation of the epithelial cells that line the initial segment (IS) of the epididymis through activation of c-ros oncogene 1 
(ROS1). Mature IS cells produce ovochymase 2 (OVCH2), which cleaves a disintegrin and metallopeptidase domain 3 (ADAM3) on 


immature sperm to attain fertilization competency. 


: sede 


Germ cells produce ender ‘ 
lumicrine factors 
(e.g., NELL2). 


ROS1 
IS differentiation and proliferation through 
activation of SFKs (SRC proto-oncogene fam- 
ily kinases), ERKs (extracellular signal-regu- 
lated kinases), and AMPKs (adenosine mono- 
phosphate-activated protein kinases) within 
the epithelial cells (5), the identity of these 
lumicrine factors has remained elusive. 

In 1999, an orphan receptor tyrosine ki- 
nase, c-ros oncogene 1 (ROSI), was identi- 
fied as the first putative lumicrine receptor 
that is expressed in the IS of the mouse epi- 
didymis (7). Although spermatogenesis was 
grossly normal, development of the IS in 
Rosi-deleted mice was impaired, and their 
sperm were unable to fertilize an egg. This 
led to the hypothesis that factors produced 
in the postnatal testis transit to the epididy- 
mis to stimulate differentiation and devel- 
opment of the IS and drive epithelial cells 
to produce molecules essential for sperm 
maturation. Unanswered questions center 
on which testicular cell types (germ cell and/ 
or somatic cell) are responsible for synthesis 
of lumicrine factors, what these factors are, 
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in the IS. 
Mining testicular single-cell RNA-sequencing 
databases revealed that Nell2 expression is 
indeed restricted to spermatocytes in adult 
mice. To consolidate the idea that spermato- 
cyte-produced factors influence IS develop- 
ment, the authors demonstrate a previously 
unappreciated immaturity of the IS in geneti- 
cally sterile mice that lack advanced germ 
cells. Together, these data provide compelling 
evidence for a previously undetermined in- 
teraction between germ cells and the somatic 
cells that line the epididymis and regulate 
sperm maturation (see the figure). 

To understand how lumicrine factors 
prepare the IS epithelial cells to partici- 
pate in sperm maturation, Kiyozumi e¢ al. 
investigated the molecular events that fol- 
low NELL2-ROS1 binding. They identified 
the uncharacterized protease ovochymase 2 
(OVCH2) that is secreted by the IS epithe- 
lial cells to cleave and thus activate a key 
protein involved in sperm-egg binding [A 
disintegrin and metallopeptidase domain 3 
(ADAM3)]. Thus, as is the case with Nel/2-, 
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Adult IS 


IS cells produce OVCH2, which 
drives sperm maturation. 


RosI-, and Adam3-deleted mice, Ovch2- 
deleted mice produced morphologically 
normal sperm that are unable to transit the 
uterotubal junction in the female reproduc- 
tive tract or bind the egg zona pellucida for 
fertilization. 

The NELL2-ROS1-OVCH2-ADAM3 _path- 
way characterized by Kiyozumi e¢ al. is an 
eloquent example of lumicrine function that 
is likely to catalyze characterization of other 
similar molecular cross-talk networks that 
regulate male fertility. Certainly, it is pos- 
sible that OVCH2 cleaves other key proteins 
in the spermatozoon that aid in its ascent to 
become fertilization competent. Further, it is 
likely that other lumicrine factors are at play 
to stimulate epididymal maturation and the 
secretion of alternate protein processing en- 
zymes that drive sperm maturation during 
epididymal transit. 

The characterization of lumicrine sig- 
naling between the germ 
cells and epididymis could 
profoundly alter the under- 
standing of male infertility 
and, further, stimulate the 
development of new ap- 
proaches for treating infertil- 
ity or devising nonhormonal 
male contraceptives. In up to 
80% of male infertility cases, 
sufficient numbers of sperm 
are produced, yet functional- 
ity is compromised (8, 9). A 
major cause of such fertiliza- 
tion failures is an inability of 
the sperm to recognize and 
bind to the zona pellucida 
(0), similar to the pheno- 
type of lumicrine-deficient 
mouse models. Given that the causes of up 
to 60% of male infertility cases are unknown 
(8), exploring defective “lumicrinology” will 
provide a new avenue for diagnosis and treat- 
ment. Moreover, targeting the lumicrine sys- 
tem may present an efficacious strategy for 
identifying druggable targets for the long 
sought-after male contraceptive pill. 


REFERENCES AND NOTES 


1. C.-H. Yeung, E. Sonnenberg-Riethmacher, T. G. Cooper, 
Biol. Reprod. 61, 1062 (1999). 
2. D.Kiyozumietal., Science 368, 1132 (2020). 
3. M.A. Baker, L. Hetherington, G. Reeves, J. Miller, R. J. 
Aitken, Proteomics 8, 2312 (2008). 
4. M.A. Baker etal.,J. Proteome Res.10, 1004 (2011). 
5. B.Xu,A.M.Washington, B. T. Hinton, Biol. Reprod. 95,15 
(2016). 
6. B.Xu,L. Yang, R. J. Lye, B.T. Hinton, Biol. Reprod.83, 807 
(2010). 
7. T.T. Turner, D.S. Johnston, J. N. Finger, S.A. Jelinsky, Biol. 
Reprod.77, 165 (2007). 
. M.Punabetal., Hum. Reprod. 32,18 (2017). 
. M.G.Hulletal., Br. Med. J. (Clin. Res. Ed.) 291, 1693 (1985). 
. K.A.Redgrove, R. J. Aitken, B. Nixon, in Binding Protein, 
K.Abdelmohsen, Ed. (IntechOpen, 2012), pp. 73-122. 


Own 


10.1126/science.abc2732 


sciencemag.org SCIENCE 


GRAPHIC: A. KITTERMAN/SCIENCE 


OCEANOGRAPHY 


Deep-ocean seafloor islands of plastics 


The processes controlling sediment transport also concentrate microplastics 


By David Mohrig 


ne of the most striking images of 

ocean pollution are the patches or is- 

lands of floating plastic debris, con- 

centrated in open-ocean gyres (7) and 

large enough to be seen from space. 

These concentrations of plastics on 
the ocean surface were first recognized in 
the early 1970s (2). Even with the impres- 
sive size of these patches, mass-balance 
estimates for ocean-borne plastics pointed 
toward a sink. That sink has recently been 
shown to be deposition on the deep seafloor 
(3). On page 1140 of this issue, Kane et al. 
document the occurrence of enriched zones 
or islands of plastic debris that accumulate 
on the seafloor of the deep ocean (4). 

Very different styles of plastics transport 
are associated with sea-surface and sea- 
floor pollution. These styles are well un- 
derstood for sediment transport on Earth’s 
surface, which in turn are linked to surface 
evolution (5). Kane et al. document that 
the accumulation of microplastics on the 
Mediterranean seafloor is not simply pas- 
sive vertical settling through the water col- 
umn as has been commonly assumed, but 
rather represents reworking by deep-sea 
currents. These concentrate microplastics 
into patches or islands at predictable lo- 
cations if the microplastics are treated as 
sediment that can be eroded, transported, 
and deposited by a deep-ocean flow field. 
The same processes that control patterns 
of erosion and deposition on terrestrial 
landscapes and shallow-marine shelves also 
pertain to plastics in the deeper ocean. The 
authors add much needed confirmation 
that concepts and methods developed from 
easier-to-access environments can be ap- 
plied with confidence to remote undersea 
inquiries (5). 

Although the primary focus of Kane et al. 
is to demonstrate the fate and focusing of 
pollutants in the deep sea, the authors also 
identify a likelihood for colocated hotspots 
in microplastic concentration and deep- 
ocean biodiversity. The same currents driv- 
ing the enrichment of microplastics are ef- 
ficient conveyers of nutrients and dissolved 
oxygen to the seafloor. Positioning of the 
colocated hotspots is affected by the sub- 
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marine topography that guides the deep- 
ocean currents. This linkage is analogous 
to those already identified between surface 
transport and food webs on terrestrial land- 
scapes (5, 6). This observation opens an 
opportunity to connect the spatial struc- 
ture of deep-water ecosystems and pollut- 
ants to the spatial structure of surrounding 
submarine landscapes. Understanding the 
connections between seafloor pollution by 
microplastics and deep-sea ecology requires 
a unified science of Earth’s surface dynam- 
ics that remains one of the great integrating 
challenges of environmental studies. 
Determining the fate of plastic debris 
transported from land to sea (7) contrib- 
utes to quantifying mass fluxes defining 
Earth’s source-to-sink system. Research on 
sediment-routing systems (8, 9) has already 


“In less than 80 years, plastic 
debris has been...distributed 
across Earth’s surface.” 


identified many of the complications and 
pitfalls inherent to tracking particles across 
the environment. Sediment-flux signals can 
be phase-shifted, lagged, and buffered by 
the internal dynamics of the transport sys- 
tem. These same dynamics will confound 
analyses of routing signals for plastics 
across Earth’s surface and into the deep- 
ocean sink. Kane et al. demonstrate that 
microplastics deposited in this deep-sea en- 
vironment are still subject to later erosion, 
transport, and redeposition due to time and 
space variations in the near-bed velocity 
fields of thermohaline or contour currents. 
Understanding the ultimate fate of these 
microplastics requires high-resolution sub- 
marine topography or bathymetry because 
the deep-sea currents interact with this to- 
pography to produce the spatial changes in 
velocity that set patterns of sediment and 
microplastic erosion, transport, and accu- 
mulation. Unfortunately, high-resolution 
bathymetric data simply do not exist at the 
global scale. The most widely used seafloor 
dataset is the General Bathymetric Chart of 
the Oceans, available on a 15-arc sec world 
grid (~463-m resolution at the equator). It 
is still surprising that this worldwide ocean 
product exists at a resolution poorer than 
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that of the 200-m digital elevation model 
available for the entire surface of Mars (0). 
Until ocean-basin scale models improve, our 
detailed understanding of solids transport 
at and near the seafloor will be restricted 
to local or regional studies connected with 
higher-resolution bathymetric models. 

Kane et al. identify extremely high con- 
centrations for microplastics in deep-sea 
sediment drift deposits (71). This style of 
deposit has been the focus of previous stud- 
ies because relatively rapidly accumulat- 
ing drift deposits constitute an excellent 
archive of proxy measures for past Earth 
states (72). This correlation of highest 
plastic concentrations with high-fidelity 
paleoenvironmental records suggests the 
potential for a Global Boundary Stratotype 
Section and Point (13), if the Anthropocene 
is formally recognized as a geologic epoch 
(14). The first occurrence of microplastic 
in core from drift deposits could serve as a 
“golden spike” recording the lower bound- 
ary or beginning of the proposed epoch (14, 
15). In less than 80 years, plastic debris has 
been effectively distributed across Earth’s 
surface. Kane et al. demonstrate that even 
in the deep sea, its motion and accumula- 
tion is dominated by the feedbacks between 
fluid flow, sediment transport, and topog- 
raphy that are an overarching hallmark of 
Earth’s surface system. 
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CHEMICAL PHYSICS 


The link between electrolytes and metals 


Photoelectron spectroscopy maps a gradual transition for solvated electrons in ammonia 


By Christine M. Isborn 


olutions of alkali metals in liquid am- 
monia have fascinated scientists for 
more than 200 years. The “fine blue 
colour” of a dilute solution indicative 
of solvated electrons was first noted 
by Sir Humphry Davy in 1808 [()), p. 
63] and independently published by W. Weyl 
in 1864 (2). A bronze sheen as the solution 
becomes highly concentrated (see the figure) 
develops as these solvated electrons coalesce 
into a metallic continuum. Charles Kraus, an 
early pioneer of the study of these solutions 
(3), noted that “these solutions...constitute a 
link between electrolytes, on the one hand, 
and metals, on the other” [(4), p. 83]. On page 
1086 of this issue, Buttersack et al. (5) com- 
bine low-temperature x-ray photoelectron 
spectroscopy with high-level simulations to 


reveal the energetics of metal-ammonia so- 
lutions across a large concentration range. 
These studies provide the missing energetic 
link to characterize the journey of solvated 
electrons from electrolyte to metal. 

Electrons are usually either relatively lo- 
calized in atomic or molecular orbitals or 
can be delocalized in the energy bands of 
extended solids. For the much less common 
case of solvated electrons—which exist in a 
number of solvents, including water, but are 
most stable in ammonia—questions remain 
about the extent of localization and the de- 
gree of association with the parent ions and 
surrounding solvent. The solvent environ- 
ment may change to support these species, 
and as the concentration increases, the elec- 
trons may interact to become spin-paired 
dielectrons and ultimately coalesce into a 
metallic state. 


A fascinating color change 


Alkali metal-ammonia solutions change from blue to bronze as the concentration of the metal is increased. 
This color change marks the shift of the solution from electrolyte to metallic. The photoelectron 
spectroscopy experiments by Buttersack et al. across the concentration range show how the electron 


gradually changes from localized to metallic. 


Electrolyte 
solution 


Solvated electrons 
A localized peak arises from the 
solvated electron. A similar 
feature appears for the dielectron. 
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metal concentration 


Conduction 
band 
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Metallic 
solution 


Metallic electrons 
At higher concentrations, metallic features 
appear in the spectrum at concentrations 
well below the visual color change to bronze. 
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These questions have motivated many 
studies of hydrated and ammoniated elec- 
trons over the years (6, 7). One of the most 
effective techniques for studying electronic 
energy levels is photoelectron spectroscopy, 
which measures the kinetic energy of elec- 
trons emitted after a substance is irradiated 
with light (based on the principles of the pho- 
toelectric effect), revealing electronic binding 
energies. This technique has been used to 
characterize hydrated electrons and ammo- 
niated electrons, albeit at low concentrations, 
well below the electrolyte-to-metal transition. 

The development of a new apparatus en- 
abled the application of x-ray photoelectron 
spectroscopy to a liquid ammonia microjet 
(8), allowing Buttersack et al. to collect the 
photoelectrons from the volatile, polar re- 
frigerated metal-ammonia solution across 
the wide concentration range of 0.012 to 9.7 
mol % metal. Over this range, the solution 
color changes from a lighter to a deeper blue 
and finally develops a bronze metallic sheen. 
The photoelectron spectra reveal the onset 
and growth of a peak beginning at a concen- 
tration of 0.08 mol % metal (see the figure). 
The energy of this peak is independent of 
the identity of the alkali metal in the solu- 
tion, which suggests that the peak arises only 
from the ammoniated electron and that the 
metal parent ion does not play a direct role 
in this transition. 

As the concentration of the alkali metal 
increases, this peak gradually grows into a 
metallic conduction band with a sharp Fermi 
edge, and an additional plasmon peak ap- 
pears. This plasmon peak is responsible for 
the characteristic bronze color of the metallic 
solution. This gradual transition is in contrast 
to the sharp transition proposed in recent 
work on metal-ammonia nanodroplets (9), 
which shows that the bulk liquid and small 
solvent clusters support different solvated 
electron structures at higher concentrations 
and therefore different metallic onsets. 

Modeling by Buttersack et al. comple- 
ments the photoelectron spectroscopy data. 
They apply a metallic free-electron gas 
model to fit the growth of the conduction 
band and the sharp Fermi edge in the pho- 
toelectron spectra as the solution undergoes 
the electrolyte-to-metal transition. These 
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metallic spectral features begin to reveal 
themselves at concentrations lower than 
when the solution begins to visually appear 
metallic bronze. 

On the dilute side of the concentration 
range (only one solvated electron or one 
solvated dielectron), ab initio molecular dy- 
namics provide structures of the electron and 
dielectron solvated by ammonia molecules. 
These structures are then used for high-level 
vertical dissociation energy computations. 
Performing ab initio molecular dynamics of 
an excess electron or dielectron in bulk am- 
monia is a substantial computational feat 
and advances the field beyond previous static 
cluster calculations to reveal a diffuse ammo- 
nia solvation shell that is similar for both the 
electron and the dielectron. The spin densi- 
ties suggest that the ammoniated electron 
resides within a cavity that is less structured 
than that of the hydrated electron. The high- 
level vertical dissociation energy calculations 
show that the energies to ionize the electron 
and the dielectron both fall within the mea- 
sured photoelectron signal. 

With greatly increased computational re- 
sources, ab initio molecular dynamics simu- 
lations could be expanded to a larger scale 
to include metal ions dissolved in ammonia 
at a variety of concentrations. Studies such 
as these may be necessary to resolve some 
of the remaining controversy of the localized 
versus delocalized nature of the hydrated 
and ammoniated electron (10-13). These re- 
sults would provide additional atomistic and 
electronic details of the electrolyte-to-metal 
transition. For now, the spectroscopic stud- 
ies by Buttersack et al. provide the missing 
energetic and spectroscopic link and reveal 
the gradual transition to metallic behavior 
before our eyes can see it. 
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Unblinding with infrared 


nanosensors 


Gene therapy and nanotechnology come together 


to fight degenerative blindness 


By Katrin Franke and Anna Vlasits 


any cases of blindness result from 
progressive loss of photoreceptors, 
which are the light-sensing cells in 
the eye. For individuals with such 
progressive blindness, potential 
therapies aim at restoring vision by 
making the retina light-sensitive again while 
minimally interfering with any healthy pho- 
toreceptors—goals that are usually contradic- 
tory. Many current therapeutic strategies in- 
terfere with remaining vision, making them 
primarily suitable for patients who have lost 
all light sensitivity. On page 1108 of this issue, 
Nelidova etal. (1) present a potential solution to 
this conundrum: making the retina sensitive 
to infrared light, which is largely undetect- 
able by human photoreceptors. They use en- 
gineered nanoparticle sensors and gene ther- 
apy to induce infrared light sensitivity in mice 
with inherited degenerative blindness and in 
postmortem human retinas. This approach 
might avoid damage to functional photore- 
ceptors by preventing saturation or hyper- 
activation while inducing light sensitivity in 
patients with partial retinal degeneration. 

At the level of the retina, a multilayered ar- 
ray of more than 100 types of neurons sorts 
complex visual features, such as motion and 
color, into separate channels to send to the 
brain (2). When photoreceptors fail, the en- 
tire downstream network is affected, and 
restoring the visual system’s physiological 
function becomes challenging. In addition, 
mammalian rod and cone photoreceptors, 
unlike those of some species in the animal 
kingdom, cannot regenerate. For patients 
with degenerative blindness, the aim of ther- 
apy is therefore twofold: to slow degenera- 
tion while preserving remaining vision, and 
to restore some vision once degeneration is 
complete. Several types of therapies show 
promise in slowing degenerative blindness, 
including targeted gene therapies and stem 
cell-based approaches (3, 4). One example 
is Leber congenital amaurosis, a disease for 
which a gene therapy has led to an improve- 
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ment in patients’ visual function (5). Other 
therapeutic strategies include electrical im- 
plants and optogenetic gene therapies, which 
aim at restoring vision (4, 6). 

The optogenetic approach induces the 
expression of light-sensitive ion channels 
through gene therapy to restore light sensi- 
tivity to retinal neurons. Several molecular 
candidates can restore light sensitivity in ani- 
mals and in postmortem human retinas (7), 
and some are now in clinical trials. But cur- 
rently these molecules require much more 
light than normal photoreceptors to become 
activated, and most therapies would require 
video goggles to boost the effective bright- 
ness of incoming images. For patients with 
some remaining vision, this strategy would 
saturate or possibly even damage their re- 
maining functional photoreceptors. 

Nelidova et al. circumvent these problems 
by making the retina sensitive to infrared 
light, which is light beyond the visible spec- 
trum and emitted by, for example, warm ob- 
jects. Their approach combines gene therapy 
with the use of gold nanorods, an emerging 
nanotechnology for activating molecules 
in the human body (8). Nelidova et al. use 
gold nanorods as antennae for infrared light, 
transforming the light into heat through a 
process called surface plasmon resonance. 
Genetic constructs injected into the eye then 
cause the expression of temperature-sensitive 
transient receptor potential (TRP) channels 
in photoreceptors. Such TRP channels are 
normally found in mammalian heat-sensing 
nerves in the skin, as well as in the infrared- 
sensing organs of some snakes and vampire 
bats, and are able to transform heat into elec- 
trical changes in the membranes of cells (9). 
The authors use antibodies to link the heat- 
emitting gold nanorods to heat-sensitive TRP 
channels. Thus, infrared light can activate 
photoreceptors (see the figure). 

To test whether this strategy can restore 
visual function, Nelidova et al. express TRP 
channels in cone photoreceptors of a mouse 
model of degenerative blindness. They find 
that neural activity measured in the retina 
and visual cortex correlated with infrared 
light stimuli. In addition, they show that 
treated blind mice can use their infrared light 
sensitivity to learn a simple visually guided 
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behavior. The authors tested their nanorod- 
TRP channel approach in cultured, light-in- 
sensitive postmortem human retinas, dem- 
onstrating that it introduces infrared light 
sensitivity to this tissue—a critical step in 
evaluating its relevance for human patients. 
A different nanotechnology, called up- 
conversion nanoparticles, which binds to 
photoreceptors and “up-converts” infrared 
into visible light, can make photoreceptors 
virtually infrared-sensitive (10). Although de- 
generating photoreceptors would likely not 
be able to use up-conversion nanoparticles, 
mice treated with this technology used their 
infrared sensitivity to perform complex vi- 
sual tasks including shape recognition. Such 
detailed behavioral evaluation is critical, be- 


as well as to provide long-term effectiveness 
(14). More specifically, because nanorods and 
TRP channels cannot currently be targeted 
selectively to degenerating photoreceptors, 
the interaction between induced infrared 
sensitivity and the intrinsic light sensitivity 
of healthy photoreceptors requires further 
investigation. In addition, many objects that 
humans see are not necessarily infrared- 
emitting or infrared-reflecting; thus, goggles 
to convert visible light to infrared light would 
likely be necessary. Nonetheless, this system 
has exceptional promise for basic research. 
Tools that can reintroduce light sensitivity to 
postmortem human retinas (75) offer the po- 
tential for studying human retinal function 
in much greater detail than was previously 


Nanorods and heat-sensing proteins for infrared detection 
Injecting the eye with infrared-sensitive nanorods and genetic constructs to induce the expression of 
temperature-sensitive transient receptor potential (TRP) channels in photoreceptors may confer infrared vision. 


Retina 
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“ ganglion —+ 
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order 
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photoreceptors | 
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to the brain 


cause it is not clear to what extent the already- 
developed brain can interpret a new sensory 
modality to guide behavior—although studies 
support some plasticity of the adult mam- 
malian brain for integrating new sensory in- 
put (7/7, 12). Both examples demonstrate the 
strengths of nanotechnology tools over other 
methods. These tools are more light-sensitive 
than conventional optogenetics, approaching 
the sensitivity needed to work under normal 
daylight levels. Because the nanoparticles 
harness a different wavelength of light, it 
might be possible for normal vision and in- 
frared vision to operate in parallel. 

The nanorod-TRP channel approach used 
by Nelidova et al. faces further challenges 
before it can reach the clinic. It is promising 
that gold nanorods have, so far, appeared to 
be safe in humans (13). Similarly, ocular gene 
therapies seem to be low-risk and effective (3, 
4). However, the main challenge of any ocu- 
lar gene therapy is to improve the efficiency 
and completeness of gene introduction (3) 
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Photoreceptors can become degenerated, 
making them no longer sensitive to light. 

This renders individuals partially sighted 

or blind because photoreceptors cannot 
activate second- and third-order neurons to 
send visual information to the brain. 


2 1 Gold 
nanorods 
C +) 2 Heat 


Photoreceptor 
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1 Antibodies link infrared-sensitive 
nanorods to temperature-sensitive 
TRP channels expressed by 
photoreceptors through gene therapy. 

2 Nanorod-emitted heat opens the 
TRP channels. 

3 This introduces a current in 
the photoreceptors that activates 
the downstream circuitry. 


possible. This basic knowledge is important 
for any approach to vision restoration, as it 
would reveal what kinds of functions need to 
be restored. 
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SYNTHETIC BIOLOGY 


Follow the 
barcoded 
microbes 


Genetically engineered 
spores give object 
provenance technology 
new avenues 


By Jeff Nivala 


racking where a physical object origi- 

nated and where it has been, known 

as object provenance, is becoming 

increasingly important with the glo- 

balization of supply chains (J). Object- 

labeling technologies that are scal- 
able, robust, and difficult to falsify would 
support, for example, the mitigation of 
foodborne illness outbreaks and the manu- 
facture of counterfeit goods. However, there 
currently exists no single tagging method 
that satisfies all these requirements. On 
page 1135 of this issue, Qian et al. (2) de- 
scribe a new tagging technique, called the 
barcoded microbial spores (BMS) system, 
that uses genetically engineered microbes as 
molecular tags to address the object prov- 
enance problem. The authors demonstrate 
that BMS can label a range of surfaces and 
persist for months in real-world conditions. 
Furthermore, they show how this technol- 
ogy can tag objects that come into only brief 
contact with BMS-labeled surfaces, suggest- 
ing the utility of BMS in forensic surveil- 
lance applications (3). 

Microbes are ubiquitous in our everyday 
environments, with different geographical 
locations having distinct microbial popula- 
tions. Physical objects can even take on the 
microbial signature of their environments 
over time (4, 5), which has led to the sugges- 
tion that this naturally occurring signature 
could be used for object provenance (6). 
Although it would be convenient to have 
environmental microbes acting as auto- 
matic label-makers, actually implementing 
such an approach would pose a number of 
challenges. For instance, extensive environ- 
mental mapping of microbes would have to 
be done first. This would be expensive and 
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time-consuming. Furthermore, en- 
vironmental microbial abundances 
can be similar at disparate locales 
and also change over time, making 
the use of natural microbe labels a 
nonstarter for many applications. 

Instead, Qian et al. take this con- 
cept into the realm of synthetic 
biology by engineering synthetic 
strains of microbes to function as 
molecular labels. To do this, they in- 
serted short, specific DNA sequences 
(barcodes) into the genomes of the 
bacterium Bacillus subtilis and the 
yeast Saccharomyces cerevisiae. 
They designed and experimentally 
validated dozens of barcode se- 
quences that could be used in com- 
binations to generate an essentially 
infinite number of potential tag sets. 
To tag an object, a set of barcoded 
strains were mixed together in spore 
form and sprayed on the surface of 
an object. A spore is a physically 
tough, inactive cell state that can 
persist in harsh environmental conditions 
without growth (7). This resilience allows 
the barcoded microbial spores to endure in 
diverse ecosystems without rupturing and 
exposing their DNA barcodes to the ele- 
ments, risking tag loss. 

To identify the barcodes, the surface of 
the object is swabbed to collect a sample, 
which is prepared for input to one of a 
number of different decoding devices, in- 
cluding a DNA sequencer or a quantitative 
polymerase chain reaction machine. In 
addition to these readout 
options, Qian et al. also 
focused on using a nascent 
DNA detection technol- 
ogy called SHERLOCK (8), 
which is more amenable 
to deployment in field 
settings. This feature is 
critical to many potential 
provenance applications. 
BMS decoding should also 
be compatible with other 
field-deployable DNA- 
sensing technologies, such 
as commercial nanopore devices (9), al- 
though the authors did not explore this 
compatibility. 

The release of genetically modified or- 
ganisms into uncontrolled environments 
may come with risks. So, Qian et al. built 
clever safeguards into the BMS system to 
prevent the unintended spread and prolif- 
eration of their microbial tags in the envi- 
ronment, beginning with careful selection 
of the microbial species themselves. Both 
B. subtilis and S. cerevisiae are commonly 
found in the environment and in food 
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“Perhaps the most 
powerful use of 
microbial tags will 
come from their 
application to 
agricultural and other 
food supply chains.” 


Produce provenance data encoded in 
the DNA of a synthetic microbe 


Genetically barcoded spores could be used for a new generation 
of object (e.g., food) provenance applications. 
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samples, and products derived from them 
have already been granted “generally rec- 
ognized as safe” (GRAS) status by the U.S. 
Food and Drug Administration. To prevent 
their proliferation in native environments, 
Qian et al. used auxotrophic B. subtilis and 
S. cerevisiae strains that require supple- 
mentation of key amino acids for growth. 
The authors took additional steps to pre- 
vent the microbial spores from returning 
to an active cellular state by either remov- 
ing genes essential to this process (for 
B. subtilis) or by boiling 
the spores to permanently 
heat-inactivate them (S. 
cerevisiae). These mea- 
sures were adequate to en- 
sure that the spores did not 
replicate even in the most 
favorable laboratory con- 
ditions. As there is still the 
potential for auxotrophs 
to grow by scavenging for 
metabolites in the envi- 
ronment, these additional 
safeguards are prudent. 
An alternative strategy to biocontainment 
would be the use of so-called “recoded” cell 
strains that have been engineered to be 
dependent on completely synthetic amino 
acids that are not found in natural ecosys- 
tems (J0, 11). 

The value of a provenance system de- 
pends on its lifetime and applicability to 
different objects. The BMS system was 
prototyped on an impressive number of 
surfaces and simulated environments, in- 
cluding sand, soil, carpet, and wood, in 
addition to an outdoor grass area. Notably, 
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the barcoded microbes persisted 
and remained detectable on these 
surfaces for months, even after 
real or simulated weather condi- 
tions and physical perturbations 
such as vacuuming and sweep- 
ing. Beyond typical object prov- 
enance, in which the labels are 
applied directly to the object be- 
ing tracked, Qian et al. also dem- 
onstrated that BMS tags could be 
transferred to other objects that 
come in transient contact with 
a labeled surface. The authors 
found that they could even trace 
the path of shoes that had walked 
on a BMS-tagged floor. The poten- 
tial use for such a technology by, 
for example, law enforcement is 
compelling, although additional 
work is needed to determine how 
the BMS system withstands more 
rigorous real-world stresses, such 
as the application of cleaning 
agents to the surface. 

Perhaps the most powerful use of mi- 
crobial tags will come from their applica- 
tion to agricultural and other food supply 
chains. For example, the tags could safely 
be sprayed directly on food products, as 
Qian et al. showed with leafy plants (see 
the figure). Bacillus thuringiensis, which 
is closely related to B. subtilis, is already 
in common use as an insecticide for agri- 
cultural products that are commercially 
available today (12). The authors found 
that B. thuringiensis genomic DNA can 
be detected on store-bought produce even 
after washing, boiling, frying, and micro- 
waving, thus highlighting the hardiness 
of DNA-based tags. As would be expected, 
other microbe and DNA-based provenance 
systems are also being developed, in both 
academia (13) and industry. We may well 
be on a path to a brave new world in which 
food provenance tracking goes not just 
from the farm to the table, but all the way 
to the sewer (1/4). & 
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Serology assays to manage COVID-19 


Measurement of antibodies to SARS-CoV-2 will improve disease management if used correctly 


By Florian Krammer’ and Viviana Simon'?3 


n late 2019, China reported a cluster of 

atypical pneumonia cases of unknown 

etiology in Wuhan. The causative agent 

was identified as a new betacoronavi- 

rus, called severe acute respiratory syn- 

drome-coronavirus 2 (SARS-CoV-2), that 
causes coronavirus disease 2019 (COVID-19) 
(1). The virus rapidly spread across the 
globe and caused a pandemic. Sequencing 
of the viral genome allowed for the develop- 
ment of nucleic acid-based tests that have 
since been widely used for the diagnosis of 
acute (current) SARS-CoV-2 infections (2). 
Development of serological assays, which 
measure the antibody responses induced by 
SARS-CoV-2 infection (past but not current 
infections), took longer. This is in part due to 
bottlenecks with availability of positive con- 
trol sera and the need for extensive specificity 
and sensitivity testing in the context of pre- 
existing immunity to seasonal coronaviruses. 
Serological assays are important for under- 
standing the prevalence of and immunity to 
SARS-CoV-2. 

Many types of serological assays have been 
developed over the past decades to measure 
antibody responses to pathogens in bodily 
fluids, especially blood serum or plasma. 
These assays use different platforms, includ- 
ing binding assays such as enzyme-linked 
immunosorbent assays (ELISAs), lateral flow 
assays, or Western blot-based assays. In ad- 
dition, functional assays that test for virus 
neutralization, enzyme inhibition, or bacte- 
ricidal assays can also inform on antibody- 
mediated immune responses. Collectively, 
serological assays are essential tools in the 
management of infectious diseases, includ- 
ing diagnosis of infection, measurements of 
protective antibody titers upon vaccination, 
and seroprevalence assessments of immunity 
in a population. 

Serological assays for SARS-CoV-2 are 
now becoming widely available and include 
ELISAs (3-7), lateral flow assays (5, 8, 9) (see 
the figure), and virus neutralization assays. 
ELISA and lateral flow assays are performed 
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with recombinant antigens, such as the spike 
protein (the main surface glycoprotein that is 
used to attach and enter cells) of SARS-CoV-2; 
the receptor-binding domain (RBD), which 
is part of the spike protein; or the viral nu- 
cleoprotein. Of note, using the SARS-CoV-2 
nucleoprotein is expected to induce more 
cross-reactivity (antibodies that bind to mul- 
tiple strains of coronavirus) than the spike 
protein, owing to sequence homology of the 
viral nucleoprotein. These assays can be han- 
dled at biosafety level 2 (and therefore can be 
carried out more widely), given the recombi- 
nant nature of the selected antigens. By con- 
trast, neutralization assays with replication- 


dated sensitivity and specificity performance 
is critical for obtaining meaningful results. 
For some applications, such as serosurveys 
in high-prevalence populations, somewhat 
lower specificity is acceptable, whereas sensi- 
tivity should be high. For uses where a false- 
positive test result would be consequential, 
very high specificity is essential. In general, 
both sensitivity and specificity should be as 
high as possible. 

An important application of serological 
tests is to understand the antibody responses 
mounted upon SARS-CoV-2 infection and 
vaccination. Assays that inform on antibody 
titer and/or show antibody functionality 


Quantitative and binary readouts in serology assays 


Quantitative and binary serology tests can provide important information about infection. 


Quantitative assays [e.g., enzyme-linked 
immunosorbent assay (ELISA)] 


Protected? 


Assay with binary result 
(e.g., lateral flow assay) 


Response 


No response 
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Yes 
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competent SARS-CoV-2 have to be performed 
in biosafety level 3 facilities, which limits their 
application. Safer and more high-throughput 
alternatives to using infectious virus are 
under development and include the use of 
pseudotyped viral particle assays, in which 
the SARS-CoV-2 spike protein is grafted onto 
harmless viruses or virus-like particles. 

A limited number of ELISA and lateral 
flow assays have recently received emer- 
gency use authorization from the U.S. Food 
and Drug Administration (FDA). In addition, 
many lateral flow assays from different com- 
panies are available, but their usefulness is 
questionable, given the lack of official perfor- 
mance validation with respect to sensitivity 
(how many true positives are detected) and 
specificity (the proportion of false positives) 
(9-11). Using serological assays with vali- 
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(e.g., virus neutralization) will be extremely 
useful to answer important scientific ques- 
tions about immune protection from reinfec- 
tion. For example, do all infected individuals 
mount a robust antibody response to SARS- 
CoV-2 infection? It is unclear whether there 
is a difference in the antibody responses 
found in individuals presenting with severe, 
mild, and asymptomatic COVID-19 and how 
long antibody responses last. Moreover, it is 
unknown if the presence of binding antibody 
to the spike or RBD antigens correlates with 
virus neutralization. Whether antibody titers 
(binding or neutralizing) correlate with pro- 
tection from reinfection is also unclear. Such 
data will be important when dissecting an- 
tibody responses generated by natural infec- 
tion compared to vaccination. 

Serological testing can also inform on the 
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prevalence of SARS-CoV-2 infections in dif- 
ferent populations. Although it is impractical 
to test the whole population, well-designed 
serosurveys are essential to determine how 
prevalent COVID-19 is in the general popu- 
lation, in selected subsections of the popula- 
tion (e.g., health care workers), or in specific 
risk groups. Both quantitative assays and 
assays with a binary outcome can be used 
for these surveys. Quantitative assays may 
provide more reliable results [e.g., two-step 
ELISAs (72)], but they are also harder to scale 
because they often have to be performed in 
specialized laboratories. By contrast, assays 
with binary outcomes (e.g., lateral flow as- 
says) can be easily scaled and implemented 
because they are often point-of-care tests. 
Analyses of the results of serosurveys need 
to account for the sensitivity and specificity 
of the assay used as well as the estimated 
prevalence of infections in a population. In 
addition, biological variables resulting from 
in-depth characterization of the immune 
responses such as, but not limited to, the 
duration of the immune responses and the 
dynamic nature of antibody titers linked to 
severe, mild, and asymptomatic COVID-19 
manifestations will need to be factored into 
calculating prevalence based on serosurveys. 
Currently, many of these critical variables are 
unknown, and any serosurvey analysis gener- 
ated in the immediate future should be inter- 
preted with caution. 

Donors for convalescent plasma therapy 
can be identified with serology testing. 
Antibody-rich plasma or serum from con- 
valescent individuals (or animals) has been 
used to treat many infections as well as 
snake bites. One of the earliest examples is 
the treatment of diphtheria with antiserum 
obtained from horses for which Emil von 
Behring received the Nobel Prize in 1901. 
More recently, antiserum has been used for 
the treatment of a range of viral infections 
(e.g., infections with Hantaan virus, Junin vi- 
rus, measles virus, and Ebola virus, as well as 
potential rabies infections). Individuals who 
recover from COVID-19 develop antibodies to 
SARS-CoV-2. During the initial stages of the 
COVID-19 epidemic in China, convalescent 
plasma therapy was used compassionately 
(78) and has since been implemented in the 
United States and elsewhere. The success of 
this intervention likely increases with the 
antibody titer of the donor. It is, therefore, 
important to screen potential convalescent 
donors so that individuals with the highest 
antibody titers can be selected. This screen- 
ing can be accomplished by measuring virus- 
neutralizing activity of the plasma, which is 
a lengthy process (several days) and needs to 
be performed in a biosafety level 3 laboratory. 
ELISA-based antibody testing that produces 
a titer is quick (hours) and easy to perform. 
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Quantitative measurements of antibody ti- 
ters from at least two different ELISAs have 
been shown to correlate well with neutraliz- 
ing titers (3, 4). 

Identifying individuals who are immune is 
an important but also complex and politically 
charged application of serological assays. 
Individuals who were infected with “common 
cold” human coronaviruses develop antibody 
responses and are protected from reinfection 
for a certain period of time, likely for years 
(74). If reinfection occurs, it is often mild or 
asymptomatic. In addition, infection with 
SARS-CoV-1 was shown to induce neutraliz- 
ing antibody responses that last for several 
years (14). On the basis of these data, indi- 
viduals with antibodies to SARS-CoV-2 are 
assumed to be less susceptible to reinfection, 
reducing the risk of severe COVID-19 and also 
limiting the possibility of spreading the virus. 
Therefore, it has been proposed that indi- 
viduals with robust antibody responses could 
safely return to normal life and work, slowly 
starting the economy on a path to recovery. 
Detection of protective immune responses is 
also an important consideration for health 
care workers. In addition, people immune to 
SARS-CoV-2 could be spared from quarantine 
and social distancing measures during a po- 
tential second or third wave of SARS-CoV-2 
infections in the winter of 2020. Accordingly, 
some countries have proposed an “immune 
passport” for such individuals. 

However, there are numerous caveats that 
should be carefully considered before pro- 
ceeding. It needs to be demonstrated that 
individuals who have developed antibodies 
to SARS-CoV-2 are protected. If antibodies 
provide immunity and protection, it is not 
(yet) known how long they will persist at 
the needed titer. A person protected today 
might no longer be protected in 6 months. It 
is, therefore, a matter of urgency to conduct 
studies aimed at dissecting the magnitude, 
duration, and functionality of the immune 
responses induced by SARS-CoV-2_ infec- 
tion, including antibodies, as well as cellular 
(adaptive) immune responses, and to deter- 
mine the correlation between immune re- 
sponse and protection. In the absence of this 
knowledge, decisions about deploying the 
workforce may be based on incomplete infor- 
mation and guided by incorrect assumptions. 

A known antibody titer that correlates 
with protection would also be extremely ben- 
eficial for vaccine development. Protective 
titers and/or correlates of immune protection 
have been established for many virus infec- 
tions, including influenza virus, hepatitis A 
virus, hepatitis B virus, and measles virus. 
For several of these infections, the dynamics 
of the immune responses are well understood 
and the duration of protection based on an- 
tibody titers has been successfully modeled 
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(15). For these types of studies, serological 
assays that measure a quantitative antibody 
titer have been instrumental. However, when 
converting the concept of an “immune pass- 
port” to practice, point-of-care serological as- 
says that produce a binary response may also 
be useful. A combined strategic approach 
may be the safest while also being feasible. 
To account for sensitivity and false positives, 
if every positive lateral flow test result is con- 
firmed with a second test that produces a 
titer—which also indicates the robustness of 
the response and could be linked to the pres- 
ence and duration of protection—the number 
of false-positive results would be greatly re- 
duced. Such a targeted sequential approach 
would provide reliable information on im- 
munity and avoid putting individuals at risk. 

Several academic laboratories have devel- 
oped robust, specific serological assays, and 
high-quality commercial options are becom- 
ing available. In accordance with academic 
grassroots traditions, a toolkit to set up an- 
tibody assays has been distributed to more 
than 200 laboratories across the world, and 
a detailed protocol to facilitate local imple- 
mentation has been published (72). With 
high-quality serological assays now avail- 
able, the key challenge will be to apply and 
deploy these tests in a strategic manner to 
safely bring communities out of the current 
pandemic response back to the realm of “nor- 
mal” life. 
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Donald Kennedy (1931-2020) 


Eminent public intellectual who served science and society 


By Gretchen C. Daily and Paul R. Ehrlich 


onald Kennedy—public servant, uni- 

versity president, and former Science 

editor-in-chief—died on 21 April. He 

was 88. A born naturalist, broad in- 

tellectual, and leader, Don engaged 

in controversial areas as diverse 
as the safety of artificial sweeteners, the 
overuse of pesticides and antibiotics, the 
impacts of human population growth, the 
teaching of evolution, the philosophical ba- 
sis for valuing nature, and academic duty. 
He served with brilliance, intense energy, 
and effectiveness. His death is a great loss 
to the United States and the world at this 
crucial moment in history. 

Born in New York City in 1931, Don re- 
ceived all of his degrees (bachelor’s, mas- 
ter’s, and Ph.D.) in biology from Harvard 
University. He taught at Syracuse University 
for 4 years and then moved in 1960 to 
Stanford University, to which he devoted 32 
years of his career. After leaving the labora- 
tory bench, he took leadership roles at the 
university, in government, and finally at the 
journal Science. 

In graduate school, Don explored the re- 
lation between brain and behavior. He de- 
vised a strategy for working on the simple 
nervous systems of invertebrate animals 
such as crayfish and lobsters—a choice 
that he liked to boast was superior to my 
(P.R.E’s) choice of butterflies, in that Don 
could eat his specimens. He helped deter- 
mine how restricted networks of neurons 
give rise to sensory perception and behav- 
ioral acts, such as swimming and flying. His 
presumption that the same rules of wiring 
neurons together apply to crayfish and hu- 
mans has been justified time and again. 

Don’s research program advanced princi- 
ples that later became widespread and part 
of modern systems neuroscience. His an- 
thropocentric colleagues, mostly in medical 
schools, resisted his paradigm, complaining 
that scientists should be studying “real ani- 
mals.” His former student Ron Hoy remi- 
nisces that, in response, Don sardonically 
referred to his bottom-up methods based 
on invertebrates as the study of “virtual ani- 
mals.” He showed that stimulating just one 
or afew command neurons produces a com- 
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plex sequence of movements. His laboratory 
was also instrumental in cell and network 
visualization techniques, using fluorescent 
dyes to define the functional connectivity 
within behavioral circuits. 

In 1973, I (P.R.E.) and psychiatrist David 
Hamburg brought Don in to help organize 
Stanford’s Program in Human Biology. Don 
soon took charge of the program and pro- 
ceeded to use his uncommonly effective per- 
suasive powers to convince me to teach for 
the program for almost a decade, despite my 
intention of staying only a few years. It was 
hard to say no to Don; he exuded genuine 
warmth, both with friends and upon first en- 
counter, drawing people in with a light touch 


on the arm. Jeanne Kennedy, his first wife, 
recalls how “he could see the best in some- 
one, who they really wanted to be...and make 
them feel as if they were that best version.” 

Don served as commissioner of the Food 
and Drug Administration from 1977 to 1979 
before returning to Stanford as provost 
and then becoming its eighth president 
from 1980 to 1992. He led the university’s 
Centennial Campaign, which brought in 
nearly $1.3 billion, the largest sum ever 
raised for higher education at the time. His 
enduring investments strengthened under- 
graduate teaching and financial aid, over- 
seas study, and public service—including the 
launch of the Bing Stanford in Washington 
Program, an undergraduate internship in 
the U.S. capital. Later, Don weathered a 
challenging disagreement with the federal 
government over research reimbursement. 
Stanford was ultimately vindicated, but the 
publicity took a heavy toll. 
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In his professor days, Don was a legend- 
ary teacher and research mentor, inspiring 
large cohorts of undergraduates, graduate 
students, and postdocs, who themselves 
became leaders in science. He continued 
his mentoring as university president. 
I (G.C.D.) will never forget his addressing 
my freshman class, asking plainly, “Do you 
feel completely out of place?” and contend- 
ing, “Every one of you should be uncomfort- 
able! That’s what’ll make you grow!” He 
extended an open invitation to students to 
join him on his 6 a.m. runs and share what 
was on their minds. He would give students 
advice such as “Great plan, but you’ll never 
make a three-point shot standing right un- 
der the basket.” 

Don’s interests ranged from the practi- 
cal to the esoteric, but he always looked at 
the big picture. He wrote on wide-ranging 
subjects, from nuclear war and pandem- 
ics to “what we don’t know.” Perhaps most 
unusually, in their book Humans, Nature, 
and Birds, he and coauthor Darryl Wheye 
explored the sweeping 30,000-year history 
of the depiction of birds in art. 

During his tenure as editor-in-chief of 
Science from 2000 to 2008, Don greatly im- 
proved its coverage of ecology, evolution, 
and conservation at a time of rapidly inten- 
sifying human impacts on the biosphere. In 
doing so, he helped move problems such 
as land transformation, biodiversity loss, 
disruption of the nitrogen cycle, climate 
change, and ocean acidification to the top 
of the scientific agenda. He had contagious 
enthusiasm and boundless curiosity about 
every field the journal covered, but he was 
particularly excited when a paper touched 
on one of his personal passions. One day 
he called me (P.R.E.) and said, “We’ve just 
gotten a manuscript reporting ivory-billed 
woodpeckers in Arkansas.” We started plan- 
ning an expedition to see them, but the re- 
port of that now-extinct species turned out 
to be a false alarm, depriving us both of one 
final joint field adventure. 

It is sadly ironic that coronavirus disease 
2019 (COVID-19) killed Don. His grasp of 
the complex historical, demographic, envi- 
ronmental, epidemiological, economic, so- 
cial, and political dimensions of pandemic 
risk was incomparable. He would have been 
on the front lines, advancing a coherent 
and effective response by the United States. 
Don’s approach to every challenge was first 
and foremost based on science and evi- 
dence. He leavened this razor-sharp insight 
with a healthy sense of humor and human- 
ity that earned him respect and admiration 
and made him beloved by those whose lives 
he touched. 
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Circles on the grounds of San Francisco's Dolores Park are designed to limit the spread of SARS-CoV-2 by encouraging social distancing. 
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Which interventions work best in a pandemic? 


We can exploit randomized controlled trials, compartmental models, and spillovers 


By Johannes Haushofer?2345 
and C. Jessica E. Metcalf® 


he only approaches currently avail- 
able to reduce transmission of the 
novel coronavirus severe acute 
respiratory syndrome-coronavirus 2 
(SARS-CoV-2) are behavioral: hand- 
washing, cough and sneeze etiquette, 
and, above all, social distancing. Policy- 
makers have a variety of tools to enable 
these “nonpharmaceutical interventions” 
(NPIs), ranging from simple encourage- 
ment and recommendations to full-on 
regulation and sanctions. However, these 
interventions are often used without rigor- 
ous empirical evidence: They make sense 
in theory, and mathematical models can 
be used to predict their likely impact (J, 
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2), but with different policies being tried 
in different places—often in complicated 
combinations and without systematic, 
built-in evaluation—we cannot confidently 
attribute any given reduction in transmis- 
sion to a specific policy. 

Because many of these interventions dif- 
fer from each other in terms of their eco- 
nomic and _ psychological cost—ranging 
from very inexpensive, in the case of inter- 
ventions based on behavioral economics 
and psychology, to extremely costly, in the 
case of school and business closures—it is 
crucial to identify the interventions that 
most reduce transmission at the lowest eco- 
nomic and psychological cost. Randomized 
controlled trials (RCTs) are one of several 
methods that can be used for this purpose 
but surprisingly have received little atten- 
tion in the current pandemic, despite a long 
history in epidemiology and social science. 
We describe how RCTs for NPIs can be prac- 
tically and ethically implemented in a pan- 
demic, how compartmental models from 
infectious disease epidemiology can be used 
to minimize measurement requirements, 
and how to control for spillover effects and 
harness their benefits. 
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JUSTIFIABLE RCTS 
How can RCTs be practically and ethically 
conducted in a pandemic? In a typical RCT, 
a subset of randomly chosen individuals 
or regions receives an intervention, and a 
randomly chosen control group receives 
no intervention or a different intervention. 
The random assignment ensures that any 
later differences between the groups can be 
attributed to the intervention. During an 
outbreak, policy-makers must decide which 
interventions to impose when, and when to 
loosen them again. It will rarely be feasible 
in this context to omit individuals or regions 
entirely. However, policy-makers can use 
systematic timing of such interventions to 
both protect the population and understand 
the impact of the intervention. For example, 
when experts begin to think that measures 
can be loosened, this can be done gradually, 
so that evaluation is possible: A subset of 
randomly chosen locations (such as counties 
or municipalities) begins, and others gradu- 
ally follow suit. Comparison of the “early” to 
the “late” regions makes it possible to esti- 
mate the effects of the intervention. 

This “phase-in” or “stepped-wedge” ap- 
proach can be used at any point during the 
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pandemic. At the beginning, pro- 
tective measures can begin early 
in some areas and somewhat 
later in others. During the pan- 
demic, periods of loosened mea- 
sures may be necessary to restore 
a sense of normality and keep 
essential services working, or 
measures may have to be tight- 
ened to limit further spread of 
the virus; these periods can also 
be systematically timed to evalu- 
ate their impact. In extended 
versions, different interven- 
tions can be tested against each 
other, and different locations can 
tighten or loosen different sub- 
sets of restrictions; for example, 
schools could be opened back up, 
whereas nonessential businesses 
remain closed. 

Governments and organiza- 
tions could work with scientists 
to choose an experimental de- 
sign, implement and keep track 
of the treatment assignment, 
and measure outcomes. Studies 
of this kind can now often be 
done in nimble and practicable 
ways, reducing the oversight 
and time burden on implement- 
ing partners. Interventions 
could range from messaging 
campaigns to promote social 
distancing to laws and regula- 
tions. Where full randomization 
(without phase-in) is possible, 
this may be desirable to in- 
crease statistical power (3). 

RCTs are, of course, not the 
only method for estimating the 
impact of NPIs. Where ran- 
domization is not feasible, the “natural ex- 
periments” created by some policies can be 
exploited, such as quasi-arbitrary cutoffs 
(for example, the reopening of stores below 
a certain square footage). Observational 
studies, often integrated with mathemati- 
cal models have also contributed impor- 
tant insights. 

Great care must be exercised to make 
RCTs ethical. Several considerations are 
relevant: The approach may be ethically 
justifiable because there are two sources 
of uncertainty around most interventions. 
For any intervention, it may be uncertain 
whether its benefits in terms of reducing 
disease transmission exceed its economic 
and psychological costs or how these costs 
and benefits relate to those of other inter- 
ventions. At the same time, it is difficult 
to identify a single “correct” moment to 
loosen or tighten protective measures, as il- 
lustrated by ongoing policy debates. Thus, 
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Testing interventions during an outbreak 
Time course of infection in the absence of an intervention (red) and with an 
intervention (blue) that is either loosened (top) or tightened (middle) for 2 weeks. 
The number of cases at the end of different tightening or loosening windows 
(bounds indicated with dashed vertical lines) in regions where the intervention 
was tightened or loosened compared with regions where it remained in place 
provides a measure of the relative change in transmission associated with the 
intervention. Final size (bottom) is affected by the time point at which the 2-week 
tightening or loosening of the intervention is initiated (5). See supplementary 
materials for details. 
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equipoise may be satisfied in terms of costs, 
benefits, and timing. Policy-makers are 
therefore neither knowingly withholding a 
beneficial intervention from constituents 
nor knowingly imposing a harmful one. 
This uncertainty is likely to make staggered 
tightening or loosening of an intervention 
more acceptable to the public. 

Further, the phase-in or stepped-wedge ap- 
proach may be ethically justifiable because 
individuals in both control and treatment 
groups eventually experience the costs and 
benefits of any intervention. In addition, 
even short periods of tightening or loosen- 
ing can be used to determine the impact of 
mitigation measures, minimizing the burden 
on whichever group experiences the smaller 
benefits. A powerful illustration of the ethi- 
cal acceptability of this phase-in approach 
among both scientists and the public is its 
use in RCTs of vaccines, even for highly lethal 
pathogens such as Ebola (4). 
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MODELS TO GUIDE DATA 
COLLECTION 

Careful measurement of out- 
comes is crucial for this ap- 
proach to succeed. In particular, 
it is essential to understand the 
impact of any given interven- 
tion on the full epidemic tra- 


jectory [see supplementary 
materials (SM)]. However, the 
measurement requirements 


can be simplified if data collec- 
tion and analysis are guided by 
compartmental models from 
infectious disease epidemiol- 
ogy. The time course of infec- 
tions is affected in a SIR model 
(reflecting the three possible 
states of an individual in the 
community: susceptible, infec- 
tious, or recovered) when one 
group of locations (such as 
counties or districts) loosens or 
tightens the intervention for 2 
weeks while another maintains 
the status quo (see the figure) 
(5). Crucially, because the SIR 
model describes the entire tra- 
jectory of an outbreak using 
25 only two parameters, very few 
measurements are required to 
estimate them. In particular, us- 
ing only estimates of the num- 
ber of infections at the end of 
the intervention in treatment 
and control regions, we can es- 
timate how much a given inter- 
vention reduces transmission 
relative to no intervention (see 

25 SM). In addition, this difference 

allows policy-makers to deter- 

mine which of several interven- 
tions reduced transmission the most and by 
how much. If additional information about 
the number of infections at the beginning of 
the intervention is available, we can further 
estimate whether transmission has been suf- 
ficiently reduced that the outbreak is shrink- 
ing (corresponding to an effective reproduc- 
tive number below 1). 

Insights from epidemiology can also be 
used to address several additional ques- 
tions: In addition to learning how much an 
intervention changes the transmission rate, 
policy-makers may also want to know how 
different interventions affect the “final size” 
of the pandemic—what share of the popula- 
tion will have been infected in total when 
the pandemic has died down. Also, they 
may want to understand how a single inter- 
vention might perform if it were deployed 
at different time points during the pan- 
demic (for example, early versus late) but 
can only test it once. Additionally, they may 
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want to directly compare the effectiveness 
of two interventions, despite them having 
been deployed at different times, because it 
may not always be possible to time periods 
of tightening and loosening precisely. 

In the stylized model, all of these esti- 
mates can be derived from adding a single 
measurement at one time point to those 
described above—namely, the number of 
susceptibles [measured, for example, with 
serology (6)] (see SM). Of course, the avail- 
able capacity for polymerase chain reaction 
and serology needs to be able to support 
such studies, but testing capacity is grow- 
ing around the world, suggesting feasibility. 

An important caveat to reducing the 
measurement requirements is that the 
above approach leans on the assumptions 
of a relatively simple SIR model; in par- 
ticular, both the transmission rate and the 
impact of an intervention on this rate are 
assumed to remain constant throughout 
the pandemic. However, it is straightfor- 
ward to extend the model to accommodate 
inherent variation in transmission through 
time, and complex treatment effects that 
may be a signature of NPIs, including de- 
cay over time (for example, fatigue from a 
lockdown or fading response to a messag- 
ing campaign), persistence (for example, 
hygiene behaviors such as handwashing 
which turn into a habit), or intensification 
over time (such as messaging campaigns 
that “go viral”). In such cases, the mea- 
surement requirements will increase to 
identify the additional parameters (such as 
decay) contained in the extended model. 
Similarly, the basic compartmental model 
can be extended [to SIRS (susceptible-in- 
fected-recovered-susceptible), SEIR (sus- 
ceptible-exposed-infectious-recovered), 
or age-structured models, for example] to 
reflect additional features of the transmis- 
sion process (duration of immunity, latent 
period, or variable contact patterns over 
age) or the intervention (for example, if it 
targets specific age groups). 

Thus, the effects of interventions on dis- 
ease transmission can be estimated with the 
help of epidemiological models. However, 
the economic and psychological costs and 
benefits of such interventions are equally 
important. Reducing the number of mea- 
surements by leveraging the SIR model 
is not possible for these outcomes, about 
which the model makes no predictions and 
whose time course need not follow that 
of infections. For example, a “successful” 
intervention that reduces the risk of over- 
burdening the health system will have the 
effect of spreading the infections over time. 
This implies that the desirable behaviors in- 
duced by any intervention have to be main- 
tained for longer to outlast the duration of 
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the pandemic. This may impose psychologi- 
cal and economic costs on the population 
that are larger than those that would be 
incurred in a more temporally condensed 
pandemic. In the absence of a model, these 
effects can only be captured with careful 
measurement over time. 


SPILLOVER EFFECTS 

Interventions delivered to some regions 
or individuals but not others are likely to 
nevertheless affect those who were not tar- 
geted. Such so-called “spillover” effects pre- 
sent both a challenge and an opportunity in 
evaluating the impact of NPIs. The oppor- 
tunity is that such spillovers can generate 
strongly increasing returns to intervention 
coverage in terms of individual protection 
(7); they can therefore be harnessed to max- 
imize the effects of a given intervention. For 
example, consider a hypothetical interven- 
tion that reduces the size of a pandemic by 
15% when it is delivered to 20% of a com- 
munity. Because of the nonlinear dynamics 
of infection that arise from depletion of the 
number of those susceptible to infection, in- 
creasing the coverage to 60% may generate 
a greater-than-proportional reduction in 
pandemic size of 56%. 

At the same time, such spillovers pose a 
challenge to the estimation of treatment ef- 
fects. However, standard trial designs are 
available to enable measurement of spill- 
overs (8—I0). In particular, nonlinear returns 
to saturation (the share of the population ex- 
posed to an intervention) can be integrated 
into tests of interventions by creating varia- 
tion in spatial saturation of intervention 
delivery. For example, groups of 15 locations 
might be randomized to a “low saturation” 
condition in which a third of locations are 
treated with an intervention—for exam- 
ple, the distribution of face masks or hand 
sanitizer, or opening or closing of parks or 
schools—or to a “high saturation” condition, 
in which two-thirds of locations are treated. 
Such studies have to be relatively large scale 
to achieve adequate statistical power; power 
calculations are therefore important, and us- 
ing more than two or three levels of satura- 
tion may not be practicable. 

Because spatial spillovers may occur at 
different spatial scales, causal inference 
methods that flexibly allow for such compli- 
cations have to be used. Data on the source 
of spillovers, such as the commuting pat- 
terns of essential workers, can help iden- 
tify relevant spatial scales. The feasibility 
of this approach in terms of both statistical 
power and causal inference in the presence 
of spillovers of unknown spatial dimensions 
has been suggested by recent large-scale 
studies on the general equilibrium effects of 
economic interventions (17). Thus, tests of 
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interventions to combat COVID-19 should 
take advantage of, and measure, these non- 
linear effects of saturation. 

NPIs can be rigorously tested by using 
randomization without compromising 
scientific and ethical standards. Although 
this approach will require more time than 
generating projections from observational 
methods and mathematical models, the 
benefits in terms of accuracy could be con- 
siderable. If policy-makers and scientists 
combine insights from infectious disease 
epidemiology with carefully and ethically 
designed impact evaluation, alongside 
other empirical and theoretical methods 
for studying impact (12-14), they will have 
a powerful tool for reducing the human 
health, societal, and economic costs in the 
SARS-CoV-2 pandemic and in pandemics 
in general. 
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SCIENCE AND REGULATION 


Filling gaps in science exposes 
gaps in chemical regulation 


Examination of U.S. and EU regulatory systems raises more 


questions than answers 


By Steve C. Gold! and Wendy E. Wagner? 


he regulation of chemicals should 

protect public health and the envi- 

ronment from undue risk of harm, 

should promote the development 

and use of safer alternatives to more 

hazardous chemicals, and should pro- 
vide the public with sufficient information 
to understand how well chemical risks are 
being managed. How well are these goals 
being achieved? The regulatory system in 
the United States has been derided as dys- 
functional (7), even with major amendments 
enacted in 2016 (2) that some supposed 
would bring the U.S. program closer to the 
European Union’s REACH (Registration, 
Evaluation, Authorisation and Restriction of 
Chemicals) Regulation. To build on the lit- 
erature that documents the shortcomings of 
chemical regulation (J, 2), we take as a con- 
venient example the compounds described 
in new research by Washington et al. on 
page 1103 (3). Washington et al. report the 
unexpected presence in environmental sam- 
ples of chloroperfluoropolyether carboxylate 
compounds (CIPFPECAs), apparently used 
as a substitute for other perfluoroalkyl sub- 
stances (PFASs) that had raised environmen- 
tal concerns (3). Attempting to trace these 
compounds through the regulatory regime 
raises more questions than answers, reveal- 
ing the structural limits of existing regula- 
tion. These limits apply not only to this par- 
ticular case but to myriad chemicals. How 
much confidence do regulatory systems give 
the public that substitute chemicals are safer 
than the substances they replace? Not nearly 
as much as one would like. 


BIRTH OF A NEW CHEMICAL 

The compounds found by Washington et 
al. apparently were first reported in the lit- 
erature as part of the search for “environ- 
mentally friendly” replacements for chloro- 
fluorocarbons and hydrofluorocarbons (4). 
They appear to have found a use, however, 
in the manufacture of fluorinated polymers 
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with nonstick properties for use, for exam- 
ple, in cookware (5, 6). As such, they helped 
answer an urgent desire to eliminate the 
use and release of perfluorooctanoic acid 
(PFOA) and its precursors and homologs 
(3), which had been widely dispersed in the 
environment as a result of the production 
of other substances with similar properties 
[see supplementary materials (SM)]. Amid 
growing evidence that PFOA and similar 
compounds were highly persistent, bioac- 
cumulative, and potentially toxic, as well as 
a spate of private lawsuits and increasing 
state and federal government scrutiny (see 
SM), eight companies voluntarily agreed “to 
work toward eliminating PFOA from emis- 
sions and in product content” in the United 
States by 2015 (7). By the time of this agree- 
ment between the companies and the U.S. 
Environmental Protection Agency (EPA), 
the phase-out of PFOA was already well 
under way informally (see SM). EPA’s PFOA 
Stewardship Program required the compa- 
nies to submit annual reports of their prog- 
ress, although, as a voluntary agreement, it 
had no enforcement mechanism (7). And be- 
cause the program focused only on reducing 
and eventually eliminating PFOA emissions 
and PFOA in product content, it did not in 
any way address any substitute compounds 
the companies might develop, use, gener- 
ate, or release in lieu of PFOA. Toxicity test- 
ing and reporting for substitute compounds 
were simply outside the scope of the PFOA 
Stewardship Program. 

Toxicity testing has revealed reasons for 
concern about some of these alternatives (8, 
9), leading regulatory authorities in at least 
one U.S. state, New Jersey, to search for re- 
placement as well as legacy PFASs in the en- 
vironment (3, 9). The report by Washington 
et al.—whose authors are researchers af- 
filiated with EPA and the New Jersey 
Department of Environmental Protection 
(DEP)—grew out of that effort. Given the na- 
ture of the PFOA Stewardship Program, as 
well as other features of U.S. chemical regu- 
lation that we describe below, it makes sense 
that the CIPFPECAs the investigators found 
were previously unknown to them. 

Washington et al. detected CIPFPECAs 
in every soil sample they tested from New 
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Jersey, as well as in a stored soil sample 
taken during earlier research at a location 
more than 400 km away (3). They concluded 
that their data “strongly suggest atmo- 
spheric release” of these compounds from a 
New Jersey facility of a Solvay S.A. business 
unit (3). They also found these compounds 
in a previously taken water sample from a 
river in Italy, which they used to help con- 
firm their identification of the compounds 
(3). The water sample result agreed with 
prior findings by other researchers who 
found CIPFPECAs in the same river (6). In 
light of the fairly widespread detection of 
these compounds in environmental sam- 
ples, it is reasonable that the researchers 
suggest that further investigation of these 
compounds’ environmental fate, transport, 
and degradation is warranted, and that 
“investigation of whether these CIPFPECAs 
might be toxic is prudent” (3). 


PUBLIC TOXICITY INFORMATION 

But what can we say right now, based 
on publicly available information, about 
“whether these CIPFPECAs might be toxic”? 
One might think that government regula- 
tors could provide some answers for the 
public. As it turns out, however, trying to 
trace these compounds through the U.S. 
and European regulatory systems yields 
frustratingly unsatisfying answers and re- 
veals a dearth of publicly available research 
to support such answers as may exist. 

In the United States, regulators in 
California have expressed serious concerns 
that perfluoroether carboxylic acids—a 
class that includes the CIPFPECAs (see 
SM)—“may have similar or higher toxic po- 
tency than the longer-chain PFAAs [perflu- 
oralkyl acids] they are replacing” [(J0), p. 
38] and are similarly “recalcitrant to deg- 
radation and extremely persistent in the 
environment” [(J0), p. 10]. The agency also 
stated that perfluoropolyethers, in general, 
may contain as impurities, or upon com- 
bustion may release, PFAAs that are persis- 
tent, bioaccumulative, and potentially toxic 
(U0), p. 12]. Yet the document expressing 
these concerns focused primarily on GenX, 
another chemical in the class, and did not 
discuss the properties or potential toxic- 
ity of CIPFPECAs in particular (70). GenX 
and other PFAS chemicals have also been 
detected in environmental media at levels 
that raise regulatory concern (9, 11). 

As for Europe, a search by Chemical 
Abstracts Service (CAS) number 329238- 
24-6 reveals that the European Chemicals 
Agency (ECHA) requires classification and 
labeling of these compounds under the 
European Classification, Labelling and 
Packaging (CLP) Regulation. On ECHA’s 
chemical home page, the CIPFPECAs trigger 
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five of ECHA’s hazard codes: The substance 
is fatal if swallowed, is fatal in contact with 
skin (i.e., acutely toxic), causes severe skin 
burns and eye damage, causes liver damage 
through prolonged or repeated exposure, 
and is toxic to aquatic life with long-last- 
ing effects. For 12 other hazard categories, 
the entries simply read “data lacking.” No 
research to support any of these classi- 
fications is cited on or publicly available 
through the ECHA website. 

On the other hand, in 2010 the European 
Food Safety Authority (EFSA) approved 
CIPFPECAs for use in the polymerization 
of anti-stick coatings for repeated-use food- 
ware, subject to restrictions on process quan- 
tities and temperatures (5). EFSA found the 
chemical safe for these ap- 
plications. EFSA concluded 
that the substance passed a 
bacterial gene mutation test, 
an in vitro mammalian cell 
gene mutation test, and an in 
vitro mammalian cell chro- 
mosome aberration test. The 
sole source cited was a “dos- 
sier” provided by the manu- 
facturer, without any studies 
that the dossier may have 
included or cited (5). In re- 
sponse to our request for fur- 
ther information under the 
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the studies cannot be found 
in the public domain (see 
SM). 

Thus, in our search for 
toxicity information, we 
found more questions than answers. There 
are dire but generalized concerns of the 
state of California; five hazard classifica- 
tions listed by ECHA, with “data lacking” 
for 12 others; and a benign assessment by 
EFSA, based on limited and unpublished 
data, with respect to polymerization of 
foodware coatings. We also checked an 
enormous chemical-toxicological database 
maintained by EPA (Comptox), which in- 
cludes information on some 875,000 chemi- 
cal substances (far more than the universe 
of regulated substances). We found an 
entry for the CIPFPECAs that includes an 
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Where are the data? 


The EPA Comptox chemical- 
toxicological database includes 
an entry for the CIPFPECAs, 
but indicates that no data or 
values are available for any of 
the categories of information. 
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“executive summary” of the available data 
in 32 categories of information relevant to 
toxicity or hazard; no data or values are 
shown for any of the 32 categories (see SM). 
Looking beyond government agency web- 
sites, we also found no toxicological studies 
of CIPFPECAs in a literature search. 


THE REGULATORY STORY 

This scientific inconclusiveness leads to the 
next obvious set of questions: How could 
CIPFPECAs appear to be so unstudied and 
unaccounted for? After all, CIPFPECAs 
share at least some properties with PFASs 
that are being phased out (6, 9, 10). Society 
would presumably want to know that substi- 
tute chemicals like these are not worse than 
the chemicals they replace. 
Shifting focus to the regula- 
tory system, however, yields 
few definitive answers about 
the oversight of CIPFPECAs, 
although it does offer sev- 
eral lessons about the design 
of our regulatory programs 
more generally. 

The first lesson, just 
touched on above, is that 
while regulatory attention 
is focused on eliminating 
high-profile chemical risks, 
less effort appears to be 
dedicated to analyzing the 
safety of substitute chemi- 
cals used to replace them (72, 
13). A number of scientists 
have raised general concerns 
about the need for rigorous 
comparative assessments 
of replacement chemicals, 
particularly within the PFAS 
family (12, 13). This type of 
comparative analysis seems 
particularly appropriate in 
light of the voluntary phase- 
out of PFOA, all the more so 
because EPA has identified 
about 500 PFAS chemicals 
sold in U.S. commerce, out 
of a larger, global list of thou- 
sands of such compounds (see SM). However, 
assessments for PFAS chemicals appear to 
have been conducted—at best—on an ad 
hoc basis and primarily through negotiated 
agreements. The resulting, publicly available 
research on PFAS chemicals is quite limited. 
The state of New Jersey reports that out of a 
list of 900 PFAS chemicals, only 200 chemi- 
cals have toxicity data available at all, and 
even that research is incomplete (9). EPA has 
now instituted more uniform comparative 
assessment procedures for some PFAS chem- 
icals, but these recently revised procedures 
apply only to new polyfluoroalkyl chemicals 
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or uses produced after 2015 (see SM). It is of 
course possible that despite the regulatory 
vacuum, a comparative analysis was none- 
theless performed internally by the manufac- 
turer. If that analysis exists, however, it does 
not appear to be publicly available. 

What about oversight of CIPFPECAs un- 
der the U.S. Toxic Substances Control Act 
(TSCA)? TSCA is a regulatory program origi- 
nally enacted in 1976 to prevent unreason- 
able risks caused by chemicals. Under TSCA, 
makers of “new” chemicals (developed after 
1976) must submit a premanufacture notifi- 
cation to EPA (J). And the CIPFPECAs cer- 
tainly seem, on their face, to represent a dis- 
tinctly “new chemical” developed after 1976 
(4). A search through EPA’s TSCA inventory, 
however, provides more unsolved mysteries: 
The CIPFPECA family is not listed (by CAS 
number) in EPA’s public inventory of more 
than 40,000 registered chemicals (see SM). 

It is theoretically possible that the manu- 
facturer simply violated EPA’s registration 
requirements under TSCA, but there are 
several more likely explanations for why the 
CIPFPECAs are not listed in EPA’s inventory. 
One is that although the law generally re- 
quires premanufacture notification of “new” 
chemicals, there are multiple exemptions 
from this registration requirement. It is un- 
clear whether the CIPFPECAs satisfied any 
of these exemptions, which allow manufac- 
turers to avoid submitting a premanufacture 
notification, for example, on new chemicals 
that are long-chain polymers or that are only 
impurities. But we cannot know for sure, be- 
cause EPA generally does not provide public 
tracking of the manufacturers’ use of these 
various exemptions (see SM). 

Alternatively, it is possible that the 
CIPFPECAs are in fact tracked under TSCA, 
but the chemical is not listed in the pub- 
lic database because the chemical struc- 
ture was classified by the manufacturer 
as a protected trade secret (called CBI, for 
“confidential business information”) and 
removed from public view. Currently, more 
than 140 unidentified PFAS chemicals in 
U.S. commerce are classified as CBI, ac- 
cording to EPA (see SM). Once information 
is stamped by a manufacturer as CBI, only 
cleared government staff can view the files 
(14). Even the generic names of the CBI 
chemicals are not made public (14). Yet a 
CBI classification seems inapplicable to the 
CIPFPECAs because its chemical structure 
has been published (4). In practice, though, 
a CBI claim remains legally in place until 
either the manufacturer or EPA officially 
“declassifies” the claim (J4). Because more 
than 10,000 chemicals in the TSCA inven- 
tory are classified as CBI (/4), Congress in 
2016 required EPA to review and, if war- 
ranted, declassify a subset of CBI chemi- 
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cals. The agency is still working on this as- 
signment, with about 2000 chemicals left 
to review (see SM). CIPFPECAs may well 
be in this queue, but confirming whether 
they are is not easy; it likely requires a for- 
mal request under the federal Freedom of 
Information Act. We can therefore extract a 
second lesson about the regulatory system: 
Some chemicals may fall through the cracks 
in the public tracking system in the United 
States, not because they are adequately as- 
sessed for toxicity but for other reasons. 
Yet just because CIPFPECAs may not 
be in public view does not mean that the 
chemicals are out of range of U.S. regula- 
tors. Under TSCA, it is possible that even if 
the product is protected as a trade secret, 
EPA is actively overseeing the CIPFPECAs. 
Under this “extensive regulation” scenario, 
EPA could ask for more testing or other 
negotiated restrictions as a condition to 
approving CIPFPECAs under TSCA. Or 
the agency may be satisfied—after con- 
ducting its own analysis—that the risks 
of CIPFPECAs are not unreasonable. EPA 
might even impose new restrictions on the 
manufacture of CIPFPECAs over time if it 
learns of new “adverse effects,’ which man- 
ufacturers are required by law to report. 
Conversely, even if CIPFPECAs were sub- 
ject to premanufacture regulatory review, 
it is also possible that EPA decided to pass 
CIPFPECAs into commerce without much, 
or any, testing or analysis. Only about 15% of 
premanufacture notifications for new chemi- 
cals submitted to EPA include any health or 
safety test data at all [(75), p. 11], and EPA’s 
own statistics show that only 10% of the new 
chemicals entering commerce between 1979 
and 2016 involved restrictions or testing or- 
ders (see SM). In the case of the CIPFPECAs, 
then, EPA may have been concerned that 
there was insufficient scientific evidence 
available at the time to support an order de- 
manding more testing. Under the pre-2016 
law, EPA was required, within a brief 90-day 
period, to produce some evidence of poten- 
tial risk as a predicate to ordering additional 
toxicity tests from the manufacturers. This 
catch-22 resulted in a paucity of testing or- 
ders (J, 2). [Congress removed this legal im- 
pediment in the 2016 TSCA amendments, but 
that came too late for the CIPFPECAs (2).] 
We thus don’t know much, if any- 
thing, about the regulatory oversight of 
CIPFPECAs in the United States. This mys- 
tery provides yet a third lesson about chem- 
ical regulation: For the 40,000-plus chemi- 
cals in commerce, the burden of chemical 
assessment rests almost entirely on a small 
group of EPA regulators (7). As a result, 
some, perhaps many, chemicals likely fall 
through the cracks. Indeed, despite the 
amendments to TSCA in 2016, chemical 
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manufacturers are still not required to an- 
ticipatorily test or assess their chemicals 
as a condition to marketing in the United 
States (7, 2). Instead, it is the regulators 
who bear responsibility for identifying the 
most hazardous chemicals, identifying the 
relevant scientific literature bearing on tox- 
icity, ordering new tests, and synthesizing 
and analyzing the available information 
bearing on the safety of individual chemi- 
cals (not to mention determining proper 
policy responses) (J, 2). Manufacturers’ pri- 
mary role is to serve as respondents. In this 
role, they not only provide the information 
that EPA requests, but also have the right 
to lodge extensive and critical comments on 
the agency’s work. Manufacturers can also 
sue the agency in court, arguing that some 
aspect of the agency’s analysis might be ar- 
bitrary (see SM). And the new powers EPA 
gained from the 2016 amendments—such as 
the ability to require additional testing and 
prioritize chemicals of concern—came with 
substantial procedural impediments (2). 

What about Europe? CIPFPECAs are of- 
ficially registered in two EU regulatory pro- 
grams. As mentioned above, EFSA approved 
CIPFPECAs in the manufacture of nonstick 
coating products (5). And CIPFPECAs are 
listed—along with five hazard classifica- 
tions—in the EU’s notification (CLP) data- 
base (see SM). In neither case, however, is 
the supporting research behind these regula- 
tory findings readily available to the public. 

Additionally, in 2005, the European Union 
implemented REACH, a more aggressive 
chemical regulatory program than the U.S. 
TSCA program. Unlike TSCA, REACH re- 
quires manufacturers to conduct a compre- 
hensive literature search on the toxicity of 
each of their chemicals sold above threshold 
quantities in the European Union. If the ex- 
isting scientific information proves incom- 
plete under REACH standards, manufactur- 
ers are also required to conduct additional 
toxicity testing (J). Yet a search by CAS 
number returns no results for CIPFPECAs 
on the European Chemical Agency’s web- 
site that lists registered REACH chemicals. 
Perhaps CIPFPECAs are produced in low 
enough quantities (less than 1 tonne/year) 
to be exempted from REACH, or perhaps 
CIPFPECAs satisfy other REACH exemp- 
tions, such as those governing impurities or 
polymers as defined under REACH. Publicly 
available information again does not re- 
solve this question (see SM). 


ALONG WAY TO GO 

Public health and environmental concerns 
led to the phase-out of PFOA and its clos- 
est chemical relatives. It seems obvious 
that in such a scenario, society should 
want to be reasonably certain that the re- 
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placement chemical “cure” is not worse 
than the phased-out chemical “disease.” Yet 
the researchers from EPA and New Jersey 
DEP found substitute polyfluorinated com- 
pounds, CIPFPECAs, in their environmental 
samples (3). At present, there is little in the 
public scientific record to indicate whether 
the environmental dissemination of these 
substitute PFASs is benign or harmful, and if 
harmful, how harmful. Our examination of 
the U.S. and European regulatory programs 
raises more questions than answers about 
the extent to which CIPFPECAs are being 
tracked, studied, and regulated. Certainly 
the European Union has made more prog- 
ress than has the United States in this re- 
gard (5). Still, the toxicological mysteries of 
CIPFPECAs—and thousands of other poten- 
tially toxic chemicals that are regulated (or 
perhaps not regulated) in ways that remain 
effectively inscrutable—suggest that we 
have a long way to go in designing effective 
and accountable chemical regulation, par- 
ticularly in the United States. 
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Colonialism and its consequences 


A ravaged Arctic ecosystem serves as a warning 
of the perils of human advancement 


By Mary Ellen Hannibal 


he area known as Beringia sits atop 

the world, straddling Asia and North 

America. A mere 50 miles of water 

separate the Chukchi Peninsula in 

Russia and the Seward Peninsula in 

the United States. Terrestrial plant 
life is relatively impoverished here, but the 
Bering Strait compensates for the land’s 
parsimony and more. 

In Floating Coast, Bathsheba Demuth tells 
the story of this singular place beginning in 
the mid-1800s. Empire-making was under 
way then, both by the United States and by 
Russia, and the lifeways of native Inupiat, 
Yupik, and Chuckchi peoples were about to 
be severely curtailed. So too were the natural 
histories of the species that contributed to 
making the region. 

America was earnestly building its sov- 
ereign might, and thus its economy, during 
this period. Having exhausted species off 
both the Atlantic and Pacific coasts, whal- 
ers looking for new quarry came to Beringia 
to hunt bowheads. Whales were integral to 
just about every machine and product made 
at the time. Whale blubber was used for lu- 
bricating sewing machines and cotton gins. 
Whalebone (baleen) lent structure to um- 
brellas, fishing rods, and mattresses. Whale 
tallow was refined into soap and became a 
base for perfume. Most of all, whale oil was 
used to fuel indoor lighting. 
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The ever-ratcheting appetite for whales 
drove them to near extinction in a few 
short decades. The indigenous peoples 
whose lives depended on whales lost both 
their source of sustenance and the culture 
around which their identities had been con- 
structed for millennia. 

With whale populations depleted, indus- 
trial appetites turned to walruses, although 
it took 250 of the blubbery beasts to measure 
up to a single bowhead. “Far from Beringia,” 
Demuth writes, “walruses were a minuscule 
part of imagining a prosperous, 
mechanized future: as fan belts on 
power looms or grease in factory 
cogs....’ Wholesale depletion of 
walruses continued well into the 
early 1900s, as their blubber was 
transformed into nitroglycerin in 
service of World War I. 

While Americans denuded 
Arctic waters, Russians bristled 
at what they considered theft of 
their natural resources. The den- 
izens of the waters between the 
two sovereign nations did not 
adhere to any putative human boundary. 

“The problem of whaling, for the United 
States and Imperial Russia, was not so much 
that it killed too many whales, but that it 
brought commerce while failing at civili- 
zation,’ writes Demuth. The United States 
bemoaned profits that were being spent on 
guns and alcohol, contributing not to lawful- 
ness but its opposite. The Russians saw those 
profits as going to the wrong people (Ameri- 
cans). The solution that was negotiated was to 
establish boundaries, to “enclose” the cyclical 
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Aship traverses the 
Bering Sea near Russia’s 
Chukchi Peninsula. 


peregrinations of whales, and subsequently 
those of walruses, reindeer, and fox, in an at- 
tempt to systematize harvests and make their 
yields more reliable. Bringing little ecological 
knowledge to bear on this effort led to both 
market and population crashes: “...while dol- 
lar value might make fur seem the same as 
blubber or ivory, the rhythms that governed 
their creation were not equivalent; the mar- 
ket’s habit of supplanting one desire for an- 
other did not apprehend how a fox would 
never live on the same time as a walrus.” 

As the Russian Revolution transformed 
into bolshevism, and from thence to com- 
munism, the question of how exactly to gov- 
ern territory according to a Marxist ideal 
brought new pressures to Beringian land, 
sea, and native peoples. The cycles of nature 
notwithstanding, Stalinism in particular 
sought to mechanize production accord- 
ing to its own time frame, not only at the 
expense of the animals involved 
but by way of cruelty to the Rus- 
sian people it employed. 

Once nonliving resources in 
the form of gold and eventually 
tin and oil became the focus of 
colonial harvest, the question of 
how to enclose Beringia became 
expressly about sovereignty. “In 
Alaska and Russia at the turn of 
the twentieth century, the politics 
of mining had to do with who 
should rightfully benefit from 
gold: the individual proprietor, 
the corporation, the tsar, or the collective.” 

In Demuth’s masterful narrative, the 
very long time frame of energy production 
crashes disastrously into the ever more ex- 
pedient ways humanity purposes that en- 
ergy. Global warming, the result of quickly 
burning organic material that took eons to 
accumulate, is the apotheosis of colonial 
strategy. Floating Coast is eloquent testi- 
mony to how this strategy is not working. & 
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Where logic meets emotion 


A computer scientist gets candid about her quest to bring 
empathy to artificial intelligence 


By Vijaysree Venkatraman 


s she sat in a taxi headed to Cairo 
International Airport in September 
2001, Rana el Kaliouby remembers 
thinking, “Am I really going through 
with this?? A married woman and 
hijab-wearing Muslim, she would 
be on her own for the next 3 years, pursu- 
ing her doctorate in computer science at 
the University of Cambridge in the United 
Kingdom. In Girl Decoded, el Kaliouby and 
coauthor Carol Colman have created a rivet- 
ing memoir of a “nice Egyptian girl” who, 
despite cultural conditioning that encour- 
aged her to put her duties as a wife and 
mother first, went on to pursue her profes- 


endow machines with the ability to recog- 
nize, understand, and even express emo- 
tions. This idea ran counter to conventional 
wisdom, which posited that pure logic was 
the highest form of AI. 

In her first presentation at Cambridge, 
el Kaliouby announced that she planned to 
teach computers to read facial expressions, 
a proxy for a person’s mental state. At the 
time, computers could barely tell apart a 
human face and a piece of fruit. The project 
was audacious. 

A fellow student remarked that his brother, 
who was autistic, had great trouble under- 
standing facial expressions. (Many people 
on the autism spectrum struggle to interpret 
nonverbal cues that indicate a person’s mood 
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Rana el Kaliouby demonstrates Affectiva’s facial expression recognition technology in 2018. 


sional dreams. She would become a pioneer 
in the emerging field of artificial emotional 
intelligence (emotion AI), where research- 
ers seek to build computers that can sense 
and respond to human emotions. 

El Kaliouby was inspired by the influen- 
tial 1997 book Affective Computing by Ro- 
salind Picard. Drawing on findings from 
neuroscience, Picard’s central argument 
was that if we want smarter computers that 
interact more naturally with us, we must 
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or emotions.) Perhaps her work could ulti- 
mately help autistic people, he suggested. 

El Kaliouby was intrigued and ap- 
proached Simon Baron-Cohen, a prominent 
autism researcher on campus, to learn more. 
Baron-Cohen, it turned out, was building an 
exhaustive catalog of facial expressions to 
serve as a guide for those on the spectrum, 
and he allowed el Kaliouby to use the data- 
base to train her algorithm. Her face-reading 
code, “the Mind Reader,’ became a reality in 
2004. “If I could have done an Egyptian za- 
ghroota, the ululation of joy my people make 
at weddings, I would have,” she confides. 
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El Kaliouby writes with candor about the 
challenges she faced during her training. 
She was homesick, and Cambridge’s cold 
weather added to her misery. “I was lonely 
and my work was just creeping along,” 
she admits. She barely saw her husband, 
who had founded a software company in 
Egypt, and their friends joked that she got 
pregnant with the couple’s first child “over 
the Internet.” 

In 2004, el Kaliouby had the chance to 
show her work to the woman who had 
originally inspired her to study emotion AI. 
Rosalind Picard, who was visiting the Cam- 
bridge Computer Laboratory, was extremely 
impressed with el Kaliouby’s work and of- 
fered her a postdoc position in the Affective 
Computing group at the Massachusetts Insti- 
tute of Technology in Boston, Massachusetts. 

El Kaliouby’s reminiscences offer 
glimpses into her background and cultural 
upbringing. We learn, for example, that 
Arab men are conventionally referred to 
as the fathers of their eldest sons; for in- 
stance, “Abu Mohammed” means “father of 
Mohammed.” If a man has no sons, then he 
is typically called by his own first name. El 
Kaliouby recounts how, once, when she was 
visiting her father’s office in Abu Dhabi, a 
male colleague referred to him as “Abu 
Rana.” She realized that her father must 
have spoken of her at work with pride, and 
by using this naming convention, his col- 
league was acknowledging the young wom- 
an’s achievements. “I was deeply touched by 
the gesture,” she writes. 

In 2009, el Kaliouby and Picard founded 
the startup Affectiva to bring the emotion- 
sensing technology they had developed to- 
gether in the laboratory to the marketplace. 
Many industries now put the company’s 
software to myriad uses: from social ro- 
botics and market research to monitoring 
systems that check for distraction or drows- 
iness in drivers. In 2018, el Kaliouby was 
named to Fortune magazine’s “40 Under 40” 
list of the most influential young people in 
the world of business. 

Girl Decoded is an affecting memoir that 
highlights the tension between one wom- 
an’s upbringing and her aspirations. Such 
life stories, told well, have the power to up- 
lift us all. 
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Anew national park could give China's depleted Hainan gibbon (Nomascus hainanus) population a 
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Recovery hopes for the 
world’s rarest primate 


China was home to the world’s second- 
highest diversity of gibbons in the 20th 
century, but these species have all suffered 
extensive population declines, and regional 
species losses continue (J, 2). The Hainan 
gibbon (Nomascus hainanus), formerly 
widespread across Hainan Island (3), is 
now the rarest primate and among the rar- 
est mammals (4). It represents one of the 
oldest gibbon lineages (5) but has received 
limited conservation attention compared 
with many other species in China. Hunting 
and habitat loss have reduced the spe- 
cies to a single high-elevation population 
in Bawangling National Nature Reserve 
(4, 6). Restricted to 15 km? of fragmented 
rain forest within this reserve, the Hainan 
gibbon has persisted with fewer than 30 
individuals for several decades (3, 6, 7). 
With China’s plan to establish a national 
park on Hainan, the species has a renewed 
chance at recovery, but challenges remain. 
Although the Hainan gibbon population 
received inconsistent financial support for 
protective measures in the 1980s and 1990s 
(6), conservation efforts since then have 
yielded promising signs. Two newly formed 
social groups were discovered in 2015 and 
2019, and based on these positive trends, 
the total population size now likely exceeds 
30 individuals (8). However, the species 
has extremely low genetic diversity (9), and 
forest degradation and conversion present 
barriers to the range expansion necessary 
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for recovery (7). Increased enforcement has 
eliminated hunting as a threat (6, 7), but 
the tiny population is inherently vulner- 
able to stochastic events such as emerging 
zoonotic disease outbreaks and extreme 
climatic events. 

In 2015, the National Park system was 
established as China’s new protected area 
framework (0). Hainan Tropical Rainforest 
National Park will be established in 2020 
(11) and will cover almost one-seventh of 
Hainan’s land area. The Hainan gibbon, 
as Hainan’s endemic primate (2), has been 
designated the national park’s flagship 
species (12). Raising the species’ profile is 
an important step, but crucial data gaps 
remain, and National Park management 
must be evidence-based and guided by new 
research. In addition to increasing funding, 
it is essential to improve monitoring with 
innovative technologies, to understand 
habitat structure and resource distribution, 
and to implement science-led forest res- 
toration. In situ conservation remains the 
priority, but feasibility of intensive recovery 
strategies under potential emergency sce- 
narios must be appraised (7). Assessment 
of future climate change impacts is also 
urgently needed. Effective Hainan gibbon 
conservation and long-term protection of 
China’s tropical biodiversity require robust 
scientific understanding, including insights 
from the recovery of other highly threat- 
ened species. 
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Cetaceans under threat 
in South China Sea 


According to historical whaling and 
stranding records (J), the South China Sea 
is home to more than one-third of extant 
cetacean species on Earth, all of which are 
listed in Appendix I or II of the Convention 
on International Trade in Endangered 
Species of Wild Fauna and Flora (CITES) 
(2). Recent data gleaned from the 
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ecological knowledge of local fishers (3) 
and field surveys (4) add to the evidence 
that part or all of the South China Sea is an 
important cetacean area with high species 
diversity that deserves special conservation 
attention but has been previously over- 
looked (5). Overfishing, together with other 
anthropogenic activities in the South China 
Sea, is putting cetaceans in grave danger. 
The South China Sea contains lucrative 
fisheries (6, 7). However, in recent decades, 
illegal, unreported, and unregulated fish- 
ing has led to region-wide overfishing 
(6, 7), posing a major threat to cetaceans 
(8). Overfishing limits cetaceans’ nutri- 
tion through prey depletion (9) and is 
closely tied to vessel strikes and bycatch 
of hundreds of thousands of cetaceans 
(10, 11). Many fishing vessels also produce 
underwater noises that damage cetaceans’ 
hearing and disturb their behavior, causing 
disorientation (72). As a result, cetaceans 
could disappear from the South China Sea 
if no conservation actions are taken. 
Cetaceans are important megafauna 
and valuable marine biological resources; 
saving them is vital to maintaining 
marine biodiversity and ecosystems. 
Communication with the wider science, 
conservation, and policy communities 
about the need to conserve cetaceans 
in the South China Sea is the first step 
toward raising awareness and inciting 
policy changes by regional governments 
to manage and regulate fisheries and 
other anthropogenic activities. Meanwhile, 
regional governments should invest 
in research to investigate and identify 
important cetacean areas in the South 
China Sea, independent of any political 
or socioeconomic concerns. This research 
can inform the design and management 
of marine protected areas, where guide- 
lines or regulations are needed to control 
anthropogenic activities, including fishing. 
Finally, the migratory nature of both fishes 
and cetaceans requires all South China 
Sea claimant nations to realize that they 
have a legal obligation to cooperate on 
fisheries management. National plans of 
action against illegal, unreported, and 
unregulated fishing need to be harmonized 
throughout the region, which will both 
safeguard the food security and livelihoods 
of the communities in and around the 
South China Sea and protect vulnerable 
cetaceans from continued harm. 
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Tenure and promotion 
after the pandemic 


To compensate for declines in productivity 
induced by the pandemic, many universi- 
ties have automatically extended tenure 
clocks by 1 year (1). This move is necessary 
but not sufficient. Tenure clock extensions 
disadvantage some groups. For example, 
in economics, women on longer clocks 

due to parental leave get tenure at lower 
rates than men (2). Many men use leave to 
produce articles, whereas women are more 
likely to care for children. Tenure commit- 
tees often fail to account for differences in 
how leave time is spent. The coronavirus 
disease 2019 (COVID-19) pandemic will 
produce additional inequalities. Stay-at- 
home and public safety orders preclude 
some research, such as human subjects and 
laboratory work, while creating opportuni- 
ties for others. Now is the time to develop 
strategies to mitigate the inequitable 
effects of the quarantine. 

External reviewers and tenure and 
promotion committees should make three 
adjustments to the evaluation process. 
First, they should instruct candidates to 
pick the 6 best years of their record and 
require that evaluators and committees 
assess only the quality and impact of 
research, teaching, and service from those 
years, not the total years after completing 
their Ph.D. (or after attaining tenure, in the 
case of full professors). Second, evaluators 
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should require a COVID-19 impact state- 
ment that explains the research, teaching, 
and service that candidates were able or 
unable to do, infrastructural or financial 
constraints, and obligations including 
child and elder care. Third, reviewers and 
committees should consider qualitative 
and holistic assessments in addition to 
quantitative evaluations such as number 
of publications, citations, impact factors, 
research expenditures, and teaching scores. 
These indicators could potentially carry 
even more biases after COVID-19 (3-6). 
Let’s treat the pandemic as an opportunity 
to adopt new standards for advancement 
that are fair to caregivers and people with 
diverse research agendas. 
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TECHNICAL COMMENT ABSTRACTS 


Comment on “RNA-guided DNA insertion 
with CRISPR-associated transposases” 


Phoebe A. Rice, Nancy L. Craig, Fred Dyda 
Strecker et al. (Research Article, 5 July 2019, 
p. 48) described a system for exploiting a 
Tn7-type transposon-encoded CRISPR-Cas 
system to make RNA-guided, programmable 
insertions. Although this system has great 
promise, we note that the well-established 
biochemistry of Tn7 suggests that the par- 
ticular system used may insert not only the 
transposon but also the entire donor plasmid. 
Full text: dx.doi.org/10.1126/science.abb2022 


Response to Comment on “RNA-guided DNA 
insertion with CRISPR-associated transposases” 


Jonathan Strecker, Alim Ladha, Kira S. Makarova, 
Eugene V. Koonin, Feng Zhang 

Rice et al. suggest that the CRISPR- 
associated transposase ShCAST system 
could lead to additional insertion products 
beyond simple integration of the donor. We 
clarify the outcomes of ShCAST-mediated 
insertions in Escherichia coli, which consist 
of both simple insertions and integration of 
the donor plasmid. This latter outcome can 
be avoided by use of a 5' nicked DNA donor. 
Full text: dx.doi.org/10.1126/science.abb2920 
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Strecker eft a/. (Research Articles, 5 July 2019, p. 48) described a system for exploiting a Tn7-type 
transposon-encoded CRISPR-Cas system to make RNA-guided, programmable insertions. Although this 
system has great promise, we note that the well-established biochemistry of Tn7 suggests that the 
particular system used may insert not only the transposon but also the entire donor plasmid. 


The Scytonema hofmanni CRISPR-associated transposase 
(ShCAST) system described by Strecker et al. (1) is based on 
a Tn7-like transposon that carries its own nuclease-defective 
CRISPR-Cas system, which it uses for RNA-guided target 
choice (2). The canonical Escherichia coli transposon Tn7 
uses two enzymes to create double-strand breaks at the do- 
nor-transposon junctions: TnsB to cleave one strand (and 
subsequently attach it to target DNA) and TnsA to cleave 
the other (second) DNA strand, leading to excision of the 
transposon from a donor plasmid followed by its insertion 
into a target plasmid. When TnsA is mutated, however, the 
products are not these simple insertions of the transposon, 
but rather co-integrates in which the entire donor plasmid, 
flanked by copies of the transposon, is inserted into the tar- 
get (3). Similar co-integrate products are also created by 
systems such as Mu transposase, which is a TnsB homolog 
that lacks a TnsA partner (4). 

The ShCAST Tn7-like transposon used by Strecker e¢ al. 
is naturally lacking in TnsA. Although some DDE transposa- 
ses harbor insertion domains that trigger other mechanisms 
for cleavage of the second strand, ShCAST TnsB does not. 
Homologies to the other systems noted above therefore sug- 
gest that their products may not be solely simple insertions 
as cartooned, but instead may include fusions of the donor 
plasmid and target. Unfortunately, no experiments were 
reported that would distinguish between the two possibili- 
ties, such as measuring the size of purified “pInsert” product 
plasmids containing the transposon or testing for the pres- 
ence of donor DNA in the final products. Although the 
ShCAST integration system is suggested to bypass the need 
for homologous recombination-mediated DNA repair nor- 
mally found in some CRISPR/Cas applications, processing of 
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co-integrate products to remove the donor DNA would likely 
still require these recombination functions unless another 
nuclease processed the second strand at the donor site. 
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Rice et a/ suggest that the CRISPR-associated transposase ShCAST system could lead to additional 
insertion products beyond simple integration of the donor. We clarify the outcomes of ShCAST-mediated 
insertions in Escherichia coli, which consist of both simple insertions and integration of the donor plasmid. 
This latter outcome can be avoided by use of a 5’ nicked DNA donor. 


Rice et al. (1) raise an important point about the mechanism 
of integration of the type V CRISPR-associated transposase 
(CAST) systems we recently characterized (2). Although 
CAST loci belong to the Tn7-like class of transposons, addi- 
tional analysis indicates that type V CAST loci are most 
closely related to transposons of the Tn5053 family (3, 4), a 
distinct family of transposons that has not yet been mecha- 
nistically characterized. In particular, the Tn5053 family 
transposons lack TnsA, the enzyme responsible for the 5’ 
donor cleavage in the Tn7 transposon (5, 6), and result in a 
co-integrate product that is resolved by the site-specific re- 
combinase TniR (4). Thus, in CAST systems, the 5’ donor 
ends might not be cleaved, resulting in a co-integrate prod- 
uct containing duplicated cargo DNA and the donor back- 
bone. Alternatively, a CAST protein might either cleave the 
5' donor end or help resolve the co-integrate to yield a sim- 
ple insertion. 

To investigate the exact structure of the insertion prod- 
uct, we performed nanopore sequencing of 15 Scytonema 
hofmanni CAST (ShCAST)-mediated genome insertions in 
FE. coli and found nine simple insertions and nine co- 
integrates across several target sites (Fig. 1A). Similarly, a 
genetic assay using a plasmid target revealed about 20% co- 
integrate insertions (Fig. 1B). A tentative model is that the 
initial insertion product is a co-integrate that can be re- 
solved by the cellular DNA recombination and repair sys- 
tems. However, we cannot rule out that CAST components 
contribute to 5' donor cleavage or co-integrate resolution in 
E. coli or the native host. Notably, all instances of CAST in- 
sertion in cyanobacterial genomes identified to date are 
simple insertions. We will continue to investigate these as a 
part of our ongoing studies. 
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Use of linear or 5’ nicked DNA donors prevents co- 
integrate formation (Fig. 1C), thereby providing an approach 
for applying CAST for homologous recombination- 
independent genome engineering. Continued investigation 
of the mechanism of CAST is expected to yield a deeper un- 
derstanding of the biology of this remarkable DNA integra- 
tion system and propel its development as a molecular 
technology. 
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Fig. 1. Characterization of CAST insertion products. (A) Schematic of genome targeting experiment 
and summary of nanopore sequencing results. (B) Genetic assay for plasmid targeting. plnserts were 
retransformed and selected on CmR+ and CmR+KankR+ plates to determine the fraction of co-integrate 
insertions. The total insertion frequency was determined by droplet digital PCR and used to calculate the 
co-integrate rate. (C) In vitro reactions with purified CAST proteins using plasmid donor or PCR-amplified 


linear donor. 
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IN SCIENCE JOURNALS 


REPRODUCTIVE BIOLOGY 
Local control of sperm maturation 


Edited by Michael Funk 


ewly produced spermatozoa within the testis do not 
have fertilizing ability but become fully functional when 
they mature in the epididymis. The development of the 
epididymis itself is dependent on testicular factors 
arriving via luminal flow. Improper signaling between 
the testis and epididymis is hypothesized to result in male 
infertility. Kiyozumi et al. identified NELL2 as a testicular 
luminal protein that binds to its receptor, ROS1, on the luminal 
epididymis surface and induces epididymal differentiation 
(see the Perspective by Lord and Oatley). In turn, differentiated 
epididymis secretes a fertility-essential protease, ovochy- 
mase-2, to make spermatozoa fully mature and functional. 
Thus, testis-epididymis interorgan communication by this 
“lumicrine” regulation ensures mammalian reproduction. —BAP 


Science, this issue p. 1132; see also p. 1053 


Fluorescence microscopy image revealing localization of the protease ovochymase-2 
(purple) within the caput epididimus, which is part of the male reproductive tract in mammals 


WEARABLE DEVICES 
Improving ionic 
thermoelectrics 


Using ions as charge carriers in 
thermoelectric devices usually 
requires using either thermal 
diffusion or redox reactions at 
two electrodes with different 
temperatures. Han et al. lever- 
aged both of these strategies 
to develop a gelatin-based ionic 
thermoelectric device that uses 
alkali salts and an iron-based 
redox couple to generate a large 
thermopower. This device is 
capable of generating useful 
amounts of energy from body 
heat. —BG 

Science, this issue p. 1091 


PLASTIC POLLUTION 


Not just settling 

What controls the distribution 
of microplastics on the deep 
seafloor? Kane et al. show that 
the answer to that question is 
more complicated than particles 
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simply settling from where they 
are found on the sea surface 
(see the Perspective by Mohrig). 
Using data that they collected off 
the coast of Corsica, the authors 
show that thermohaline-driven 
currents can control the distribu- 
tion of microplastics by creating 
hotspots of accumulation, analo- 
gous to their role in causing 
focused areas of seafloor sedi- 
ment deposition. Such currents 
also supply oxygen and nutrients 
to deep-sea benthos, so deep- 
sea biodiversity hotspots are also 
likely to be microplastic hotspots. 
—HJS 

Science, this issue p. 1140; 

see also p. 1055 


CORONAVIRUS 
Smothering fecal-oral 
coronavirus spread 


Diarrhea is acommon symptom 
in patients with coronavirus 
disease 2019 (COVID-19). Zang 
et al. used organoid cultures of 
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cells from the epithelial lining of 
human small and large intestine 
as an in vitro model system with 
which to study viral entry and 
replication in enterocytes. Mature 
enterocytes expressing the viral 
receptor were susceptible to pro- 
ductive infection, which was also 
stimulated by the expression of a 
protease involved in viral entry. A 
subset of patients with COVID-19 
shed high amounts of viral RNA 
in feces, but experiments with 
simulated human colonic fluid 
suggested that any shed virus 
would be rapidly inactivated 
during transit through the colon. 
—|IRW 

Sci. /mmunol. 5, eabc3582 (2020). 


TOPOLOGICAL OPTICS 
Topological insulators in 
the spotlight 


In addition to having an insulating 
interior while at the same time 
supporting conducting surface 
states, topological insulators have 
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many other interesting proper- 
ties. Higher-order topological 
insulating states, where regions 
of interest are along edges and 
at corners, have been difficult to 
identify unambiguously. Peterson 
et al. developed a theoretical 
framework to help identify and 
characterize these exotic states, 
including a new topological 
marker—the fractional charge 
density—that can be used to 
detect topological states of 
matter when the spectroscopic 
probe of gapless surface states 
is not accessible. The agreement 
between experimental work 
and theory is encouraging for 
applicability to other topological 
platforms. —ISO 

Science, this issue p. 1114 


CANCER 
How cancer cells adapt 
to stress 


Bacteria adapt to harsh 
conditions such as antibiotic 
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exposure by acquiring new 
mutations, a process called 
stress-induced mutagenesis. 
Cipponi et al. investigated 
whether similar programs of 
mutagenesis play a role in the 
response of cancer cells to 
targeted therapies. Using in 
vitro models of intense drug 
selection and genome-wide 
functional screens, the authors 
found evidence for an analogous 
process in cancer and showed 
that it is regulated by the mam- 
malian target of rapamycin 
(mTOR) signaling pathway. This 
pathway appears to mediate a 
stress-related switch to error- 
prone DNA repair, resulting in 
the generation of mutations 
that facilitate the emergence of 
drug resistance. —PAK 

Science, this issue p. 1127 


ARCHAEOLOGY 
Timing the rise of maize 
in Mesoamerica 


Many lines of evidence suggest 
that maize (Zea mays) became 
a dietary staple across ancient 
Mesoamerica. However, there 
has been little direct evidence 
of its consumption, and the 
timing of how it came to domi- 
nate the diet of the peoples of 
the region is unknown. Using 
stable isotopic evidence from 
human skeletons excavated 
from two rock shelter sites in 
Belize, Kennett et a/. show that 
there is no clear evidence of 


maize consumption by the sites’ 
inhabitants before 4700 years 
ago. However, isotopes from 
more recent individuals show the 
increasing importance of maize 
in the diet, such that by 4000 
years ago, maize had become a 
persistent dietary staple. -MSA 
Sci. Adv. 10.1126/sciadv.aba3245 
(2020). 


ANALYTICAL CHEMISTRY 
Perfluorocarbons’ 
path into soils 


Covering carbon chains with 
fluorines has produced a variety 
of useful nonstick coatings. 
However, growing concern 
about the toxicity and extraordi- 
nary environmental persistence 
of the underlying compounds is 
spurring a search for alterna- 
tives. The precise structure 
of these next-generation 
alternatives often remains a 
trade secret. Washington et al. 
sampled soils in New Jersey and 
then used mass spectrometry 
to assign plausible structures— 
incorporating chlorine and ether 
segments into the CF, chain—to 
compounds that appear to have 
emanated from their manu- 
facture (see the Policy Forum 
by Gold and Wagner). The data 
can inform in-depth studies of 
these compounds’ environmen- 
tal transport and persistence. 
—JSY 

Science, this issue p. 1103; 

see also p. 1066 


HER JOURNALS 


MATERIALS SCIENCE 


Edited by Caroline Ash 
and Jesse Smith 


Better food preservation 


resh fruits and vegetables are a key part of a healthy diet 
but there can be considerable waste caused by spoilage 
after harvest. Shelf life can be extended by preventing 
microbial growth, dehydration, or storage at higher tem- 
peratures, but adding a wax coating, creating a reduced 
oxygen environment, or using refrigeration can be expensive 
and/or time consuming and may alter the taste of the food. 
Jung et al. developed an egg-sourced albumin coating rein- 
forced with nanocrystalline cellulose that can be made from 
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waste materials. When coated onto banana, avocado, papaya, 
and strawberry, shelf life was extended by a week, with reduced 
external browning and internal ripening. The coatings are safe 
to ingest but are also easily removed through washing. -MSL 
Aav. Mater. 10.1002/adma.201908291 (2020). 


Postharvest spoilage of fresh fruits and vegetables is reduced by a safe 
and easily removed coating made with egg albumin. 


DEVELOPMENT 
Making cartilage 
throughout life 


The skeletons of newborn 
mammals are soft and pliable 
because they are composed 
primarily of cartilage. During 


growth to adulthood, most 
cartilage is replaced by bone. 
The remaining cartilage, such 
as that found in the joints, 

does not readily regenerate, so 
joints deteriorate with age. By 
contrast, elasmobranch fish 
make cartilage throughout their 


Maize, a global staple crop seen here growing in a field in El Salvador, rose to 
dominate diets in Mesoamerica by 4000 years before present. 
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lives. Marconi et al. studied the 
cartilaginous fish Leucoraja 
erinacea, or little skate, from 
embryo to adult and observed 
that progenitor cells that sur- 
round the cartilage skeleton are 
also present in the adult. These 
cells are induced to transform 
into chondrocytes after injury. 
Understanding cartilage repair 
in skates may offer inspiration 
for research into human joint 
repair. —BAP 


eLife 9,e53414 (2020). 


PARASITOLOGY 
How helminths trump 


diabetes 


The prevalence of type 2 diabe- 
tes mellitus (T2DM) is inversely 
correlated with helminth 
infections in Asia. This may be 
because helminths have an 
immunomodulatory effect and 
thus dampen the type 1 (allergic 
type) immune responses 
underlying the proinflammatory 
state of T2DM. To test this idea, 
Rajamanickam et a/. measured 
plasma levels of cytokines in 60 
individuals living in rural India 
infected with Strongyloides 
stercoralis, a persistent nema- 
tode gut parasite, which in most 
people is symptomless. They 
compared these results with 
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plasma cytokine concentrations 
from 58 T2DM patients showing 
no worm infection. Parasitized 
individuals showed significantly 
increased levels of interleukin-1 
receptor alpha (IL-LRa), which 
is typical of type 2 immune 
responses, and low levels of a 
wide range of proinflammatory 
cytokines and chemokines. 
When diabetic subjects were 
treated for parasites, their 
proinflammatory state, typically 
marked by increased IL-18 and 
IL-6, partially rebounded. —CA 
PLOS Neg. Trop. Dis. 14, e0008101 
(2020). 


IMMUNOLOGY 
Neutrophils avoid 
a traffic jam 


Neutrophils are the most 
abundant immune cell in the 
circulation and are typically 
the first responders to sites 

of infection or injury. How 
large numbers of neutrophils 
can efficiently travel through 
capillary networks is a mys- 
tery. Wang et al. investigated 
neutrophil trafficking in mouse 
liver using intravital micros- 
copy and found that groups of 
neutrophils diverged at capillary 
bifurcations by traveling in 

an alternating pattern. This 
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phenomenon was then studied 
in acontrolled fashion using 
microfluidic chips connected 
to achemoattractant chamber. 
Neutrophils were able to bias 
the decisions made by their 
companions at bifurcations by 
altering both hydraulic resis- 
tance and chemoattractant 
gradients. It is likely that similar 
mechanisms are widely used 
to coordinate complex immune 
responses. —STS 

Nat. Commun. 11, 2385 (2020). 


EXOPLANET ATMOSPHERES 


Different as night and day 
The atmospheric composi- 
tions of exoplanets are usually 
determined by the technique of 
transit spectroscopy: When the 
planet transits between its host 
star and Earth, starlight passes 
through its atmosphere close to 
local dawn and dusk, imprinting 
additional absorption lines on 
the stellar spectrum. Pluriel et 
al. considered how the results 
are affected if the star heats the 
planet enough for atmospheric 
molecules to be dissociated on 
the planet's day side and recom- 
bine on the night side. They 
found that transit spectroscopy 
then produces highly biased 
atmospheric compositions dif- 
fering from the true values by 


Published by AAAS 


FOOD SECURITY 
Scarcity in times of abundance 


cornerstone of food security is stability of 

access to the food supply. In a pioneering 

study of fishing communities in Amazonian 

seasonally flooded forest, Tregidgo et al. show 

how seasonal fluctuations in access can lead 
to scarcity even when the resource is abundant. 
Fishing becomes more challenging in the high- 
water season, leading to severe food insecurity for 
the more deprived households in the community. 
Communities such as these that depend ona 
wildlife resource for a substantial proportion of 
their diet may become further disadvantaged as 
seasonal fluctuations in access to food become 
further exacerbated by climate change. —AMS 
People Nat. 10.1002/pan3.10086 (2020). 


For communities living along the Amazon, greater seasonal 
fluctuations in water levels brought by climate change 
are likely to make access to fish resources more challenging. 


up to three orders of magnitude. 
More sophisticated methods 
are needed to determine the 
atmospheric compositions of 
hot exoplanets. —KTS 

Astron. Astrophys. 636, A66 (2020). 


SCIENTIFIC WORKFORCE 
To a postdoc and beyond 


To be a postdoctoral fellow today 
is to exist between a short-term 
research position and the reality 
of the academic job market. 
Nowell et al. sought to identify 
professional development (PD) 
opportunities used by postdocs 
to navigate a changing career 
landscape. Using quantitative 
and qualitative data collected 
from more than 500 current 
postdocs, the authors show that 
although postdocs engaged in 
diverse PD opportunities, these 
strategies were not always useful 
or career enhancing. By describ- 
ing postdoc perceptions of PD, 
in parallel with their perceived 
usefulness, this study provides 
a comprehensive understanding 
of the misalignment of postdoc 
needs with available opportuni- 
ties and can serve as a manual 
for helping academic institutions 
direct valuable resources toward 
the most effective postdoc PD 
strategies. —MMc 

Palgrave Commun. 6,95 (2020). 
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ALSO IN SCIENCE JOURNALS deselect 


CORONAVIRUS 
Applications of antibody 
testing 


There has been a strong focus 
on testing to determine whether 
an individual has an active severe 
acute respiratory syndrome-— 
coronavirus 2 (SARS-CoV-2) 
infection, but how do we find out 
if someone has been infected 
in the past? In a Perspective, 
Krammer and Simon discuss the 
key applications of serological 
tests to detect antibodies against 
SARS-CoV-2. Serological testing 
could answer key questions 
about whether neutralizing 
antibodies form, and if they 
do, serological studies could 
assess the duration of immune 
protection. Such tests could 
also identify recovered patients 
with high amounts of antibody 
who could donate blood serum 
for therapeutic use. In addition, 
population serosurveys could 
inform mitigation practices, 
which could be especially impor- 
tant during subsequent predicted 
waves of infection. However, it is 
important to ensure that tests are 
accurate and sufficiently reliable 
first. -GKA 

Science, this issue p. 1060 


COMPUTER SCIENCE 


From bottom to top 
The doubling of the number of 
transistors ona chip every 2 
years, a seemly inevitable trend 
that has been called Moore's law, 
has contributed immensely to 
improvements in computer per- 
formance. However, silicon-based 
transistors cannot get much 
smaller than they are today, and 
other approaches should be 
explored to keep performance 
growing. Leiserson et al. review 
recent examples and argue that 
the most promising place to look 
is at the top of the computing 
stack, where improvements in 
software, algorithms, and hard- 
ware architecture can bring the 
much-needed boost. —JS 
Science, this issue p. 1079 


SCIENCE sciencemag.org 


PLASMID EVOLUTION 
Agrobacteria virulence 
writ large 


Plasmids are widespread among 
bacteria and are important 
because they spread virulence 
and antibiotic resistance traits, 
among others. They are horizon- 
tally transferred between strains 
and species, so it is difficult to 
work out their evolution and 
epidemiology. Agrobacteria, a 
diverse grouping of species that 
infect plants, inject oncogenic 
Ti and Ri plasmids, which 
cause crown galls and hairy 
root diseases, respectively. The 
upside is that these plasmids 
have become valuable biotech- 
nological tools. Weisberg et al. 
combed through an 80-year-old 
collection of Agrobacterium 
strains but found a surprisingly 
low diversity of plasmids. It is 
puzzling how limited the number 
of plasmid lineages is despite 
reported high levels of plasmid 
recombination, but what is clear 
is how plant production systems 
have influenced plasmid spread 
into various genomic backbones. 
—CA 

Science, this issue p. 1080 


STRUCTURAL BIOLOGY 
Architecture of DNA- 
organizing complex 

The highly conserved mam- 
malian CTC1-STN1-TEN1 

(CST) complex is critical for 
genome stability and telo- 

mere maintenance. Lim et al. 
solved the structure of the 
human CST complex using 
cryo—electron microscopy. CST 
forms an unprecedented and 
substantial decameric super- 
complex triggered by telomeric 
single-stranded binding. This 
decameric form with single- 
stranded DNA-binding capacity 
of up to 10 telomeric repeats, 
suggested the possibility of CST 
organizing telomere overhangs 
into compact and restrictive 
structures in a manner similar to 
the nucleosome's organization 


of double-stranded DNA. This 
work provides a platform for 
understanding the mechanisms 
of various CST functions. —SYM 
Science, this issue p. 1081 


CHEMICAL PHYSICS 
Watching electrons 
swarm ammonia 


Liquid ammonia is unusual in 
its capacity to host electrons in 
stable solution, with vivid blue 
and bronze colors signifying the 
low- and high-concentration 
regimes, respectively. Buttersack 
et al. used photoelectron spec- 
troscopy and accompanying 
theoretical simulations to track 
the precise energetic changes 
that ensued as steadily rising 
quantities of electrons were 
introduced by dissolved lithium, 
sodium, or potassium (see the 
Perspective by Isborn). The 
results point to a gradual transi- 
tion from the dilute electrolyte 
solution of paired dielectrons to 
the more delocalized metallic 
structure at the highest concen- 
trations. —JSY 

Science, this issue p. 1086; 

see also p. 1056 


ORGANIC CHEMISTRY 
Hydrogenations that 
tolerate N-O bonds 


Catalysts that add hydrogen to 
carbon-carbon, carbon-nitrogen, 
and carbon-oxygen double bonds 
are among the most widely used 
in synthetic chemistry. They are 
particularly adept at delivering 
just one of two mirror-image 
products. However, they may 
also target adjacent bonds in 
the compound that would be 
better left intact. Mas-Rosellé et 
al. report that an iridium catalyst 
paired with a strong acid can 
hydrogenate C=N bonds without 
disturbing a weak N—O bond 
on the same nitrogen center. 
The reactions proceed at room 
temperature with high enantiose- 
lectivity. —JSY 

Science, this issue p. 1098 
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NEUROSCIENCE 
Making blind retinas 
see again 


Photoreceptor degenera- 
tion is an important cause of 
blindness. Nelidova et al. used 
tunable, near-infrared sensors 
to render diseased photorecep- 
tors light sensitive again (see 
the Perspective by Franke and 
Vlasits). Gold nanorods capable 
of detecting infrared light were 
coupled with an antibody to tem- 
perature-sensitive ion channels. 
When the nanorods absorbed 
ight and converted it into heat, 
the coupled ion channels were 
gated by infrared light. Ina 
mouse model of retinal degen- 
eration, these ion channels were 
successfully targeted to cone 
photoreceptors, and responses 
to near infrared light could be 
detected. In the primary visual 
cortex, more cells responded 
to near-infrared stimuli in mice 
expressing these ion channels 
than in controls. By changing the 
length of the gold nanorods, the 
system could be tuned to differ- 
ent infrared wavelengths. —PRS 
Science, this issue p. 1108; 
see also p. 1057 


PALEOECOLOGY 
Mangroves under 
sea level rise 


The rate of sea level rise has 
doubled from 1.8 millimeters 
per year over the 20th century 
to ~3.4 millimeters per year 

in recent years. Saintilan et al. 
investigated the likely effects 

of this increasing rate of rise on 
coastal mangrove forest, a tropi- 
cal ecosystem of key importance 
for coastal protection (see the 
Perspective by Lovelock). They 
reviewed data on mangrove 
accretion 10,000 to 7000 years 
before present, when the rate 

of sea level rise was even higher 
than today as a result of glacial 
ice melt. Their analysis sug- 
gests an upper threshold of 

7 millimeters per year as the 
maximum rate of sea level rise 
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associated with mangrove verti- 
cal development, beyond which 
the ecosystem fails to keep up 
with the change. Under pro- 
jected rates of sea level rise, they 
predict that a deficit between 
accretion and sea level rise is 
likely to commence in the next 
30 years. —-AMS 

Science, this issue p. 1118; 

see also p. 1050 


IMMUNOLOGY 
Innate immune cells 
remember 


Immunological memory is a phe- 
nomenon by which immune cells 
can quickly recognize an antigen 
that the host has previously 
encountered. Certain cells of the 
innate immune system exhibit 
memory-like responses know 
as trained immunity. Rapid, 
antigen-specific secondary 
(anamnestic) responses were 
long thought to be the domain of 
B and T cells. However, Dai et al. 
report that monocytes and mac- 
rophages can acquire memory 
specific for particular major 
histocompatibility complex | 
antigens using paired A-type 
immunoglobulin-like receptors 
(PIR-As) (see the Perspective by 
Dominguez-Andrés and Netea). 
This pathway contributes to 
recognition and rejection of 
allograft-transplanted tis- 
sue from a donor of the same 
species. Genetic depletion or 
blockade of PIR-As in mice 
diminished the rejection of 
kidney and heart allografts. 
This work, which expands 
immunological memory to 
include myeloid cells, points to 
targets that may improve organ 
transplantation outcomes in the 
future. —STS 

Science, this issue p. 1122; 

see also p. 1052 


MOLECULAR BIOLOGY 
DNA barcodes in small 
packages 


Under adverse environmental 
conditions, some microorgan- 
isms form spores that provide 
robust protection for genetic 
material. Qian et al. developed a 
system in which DNA barcodes 
are encapsulated inside non- 
germinating microbial spores 
and can be dispersed on objects 
or in the environment (see the 
Perspective by Nivala). These 
barcoded spores provide a 
durable, specific marker that can 
be read out quickly with simple 
equipment. When applied to soil, 
the spores can be transferred 
to and from objects around 
them, enabling tracking at 
meter-scale resolution. On plant 
leaves, the spores are not readily 
transferred, and the authors 
demonstrate a potential use for 
tracking agricultural products. 
—MAF 

Science, this issue p. 1135; 

see also p. 1058 


GENE EDITING 
Enforced editing 


Various autoimmune diseases 
could potentially be treated 
with regulatory T cells (T,,.,.): 
but there are many hurdles 
between this idea and clini- 

cal execution. Honaker et al. 
devised a gene-editing strategy 
to enforce the expression of 
FOXP3, the master T,,,. tran- 
scription factor, in CD4* T cells 
isolated from human peripheral 
blood, thereby overcoming the 
limitations of T,,, isolation and 
expansion. The resulting stable 
FOXP3 expression enabled a 
suppressive phenotype in vitro, 
and the edited cells were also 
functional in both a xenogeneic 
graft-versus-host disease model 
and an experimental autoim- 
mune encephalitis model. This 
approach has the potential to 
rapidly translate to clinical use. 
—LP 

Sci. Transl. Med. 12, eaay6422 (2020). 
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PHARMACOLOGY 
More targeted endothelial 


protection 


The bioactive lipid SIP exerts 
effects in diverse tissues. 
Unbiased agonists of the SIP 
receptor SIP, trigger a reduced 
lymphocyte count, or lympho- 
penia, a useful feature when 
treating autoimmune diseases 
but an undesirable side effect 
when activating SIP, in other cell 
types. Poirier et al. identified a 
G protein—biased S1P, agonist, 
SAR247799, which maintained 
endothelial function and reduced 
tissue damage caused by 
ischemia-reperfusion injury in 
two different animal models. The 
characteristics of SAR247799 
show that it is possible to 
target SIP, in the endothelium 
without compromising immune 
responses. —WW 

Sci. Signal. 12, eaax8050 (2020). 
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REVIEW SUMMARY 


COMPUTER SCIENCE 


There’s plenty of room at the Top: What will drive 
computer performance after Moore’s law? 


Charles E. Leiserson, Neil C. Thompson*, Joel S. Emer, Bradley C. Kuszmaul, Butler W. Lampson, 


Daniel Sanchez, Tao B. Schardl 


BACKGROUND: Improvements in computing 
power can claim a large share of the credit for 
many of the things that we take for granted 
in our modern lives: cellphones that are more 
powerful than room-sized computers from 
25 years ago, internet access for nearly half 
the world, and drug discoveries enabled by 
powerful supercomputers. Society has come 
to rely on computers whose performance in- 
creases exponentially over time. 

Much of the improvement in computer per- 
formance comes from decades of miniatur- 
ization of computer components, a trend that 
was foreseen by the Nobel Prize-winning phys- 
icist Richard Feynman in his 1959 address, 
“There’s Plenty of Room at the Bottom,” to 
the American Physical Society. In 1975, Intel 
founder Gordon Moore predicted the regu- 
larity of this miniaturization trend, now called 
Moore’s law, which, until recently, doubled the 
number of transistors on computer chips every 
2 years. 

Unfortunately, semiconductor miniaturiza- 
tion is running out of steam as a viable way 
to grow computer performance—there isn’t 
much more room at the “Bottom.” If growth 


in computing power stalls, practically all in- 
dustries will face challenges to their produc- 
tivity. Nevertheless, opportunities for growth 
in computing performance will still be avail- 
able, especially at the “Top” of the computing- 
technology stack: software, algorithms, and 
hardware architecture. 


ADVANCES: Software can be made more effi- 
cient by performance engineering: restructur- 
ing software to make it run faster. Performance 
engineering can remove inefficiencies in pro- 
grams, known as software bloat, arising from 
traditional software-development strategies 
that aim to minimize an application’s devel- 
opment time rather than the time it takes to 
run. Performance engineering can also tailor 
software to the hardware on which it runs, 
for example, to take advantage of parallel pro- 
cessors and vector units. 

Algorithms offer more-efficient ways to solve 
problems. Indeed, since the late 1970s, the time 
to solve the maximum-flow problem improved 
nearly as much from algorithmic advances 
as from hardware speedups. But progress on 
a given algorithmic problem occurs unevenly 


The Top 

Technology 01010011 01100011 
01101001 01100101 
01101110 01100011 
01100101 00000000 [EN 
Software Algorithms Hardware architecture 

Opportunity Software performance New algorithms Hardware streamlining 
engineering 

Examples Removing software bloat New problem domains Processor simplification 

Tailoring software to New machine models Domain specialization 
hardware features 


~——— The Bottom 


for example, semiconductor technology 


Performance gains after Moore’s law ends. |n the post-Moore era, improvements in computing power will 
increasingly come from technologies at the “Top” of the computing stack, not from those at the “Bottom’, 


reversing the historical trend. 


Leiserson et al., Science 368, 1079 (2020) 
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and sporadically and must ultimately face di- 
minishing returns. As such, we see the big- 
gest benefits coming from algorithms for new 
problem domains (e.g., machine learning) and 
from developing new theoretical machine 
models that better reflect emerging hardware. 

Hardware architectures 
can be streamlined—for 
Read the full article instance, through proces- 
at https://dx.doi. sor simplification, where 
org/10.1126/ a complex processing core 
science.aam9744 is replaced with a simpler 
core that requires fewer 
transistors. The freed-up transistor budget can 
then be redeployed in other ways—for example, 
by increasing the number of processor cores 
running in parallel, which can lead to large 
efficiency gains for problems that can exploit 
parallelism. Another form of streamlining is 
domain specialization, where hardware is cus- 
tomized for a particular application domain. 
This type of specialization jettisons processor 
functionality that is not needed for the domain. 
It can also allow more customization to the 
specific characteristics of the domain, for in- 
stance, by decreasing floating-point precision 
for machine-learning applications. 

In the post-Moore era, performance im- 
provements from software, algorithms, and 
hardware architecture will increasingly re- 
quire concurrent changes across other levels 
of the stack. These changes will be easier to im- 
plement, from engineering-management and 
economic points of view, if they occur within 
big system components: reusable software with 
typically more than a million lines of code or 
hardware of comparable complexity. When a 
single organization or company controls a big 
component, modularity can be more easily re- 
engineered to obtain performance gains. More- 
over, costs and benefits can be pooled so that 
important but costly changes in one part of 
the big component can be justified by benefits 
elsewhere in the same component. 


OUTLOOK: As miniaturization wanes, the silicon- 
fabrication improvements at the Bottom will 
no longer provide the predictable, broad-based 
gains in computer performance that society has 
enjoyed for more than 50 years. Software per- 
formance engineering, development of algo- 
rithms, and hardware streamlining at the 
Top can continue to make computer applica- 
tions faster in the post-Moore era. Unlike the 
historical gains at the Bottom, however, gains 
at the Top will be opportunistic, uneven, and 
sporadic. Moreover, they will be subject to 
diminishing returns as specific computations 
become better explored. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: neil_t@mit.edu 

Cite this article as C. E. Leiserson et al., Science 368, 
eaam9744 (2020). DOI: 10.1126/science.aam9744 
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There’s plenty of room at the Top: What will drive 
computer performance after Moore’s law? 


Charles E. Leiserson’, Neil C. Thompson’, Joel S. Emer™?, Bradley C. Kuszmaul"t, 
Butler W. Lampson*“, Daniel Sanchez’, Tao B. Schardl* 


The miniaturization of semiconductor transistors has driven the growth in computer performance for 
more than 50 years. As miniaturization approaches its limits, bringing an end to Moore’s law, 
performance gains will need to come from software, algorithms, and hardware. We refer to these 
technologies as the “Top” of the computing stack to distinguish them from the traditional technologies 
at the “Bottom”: semiconductor physics and silicon-fabrication technology. In the post-Moore era, the 
Top will provide substantial performance gains, but these gains will be opportunistic, uneven, and 
sporadic, and they will suffer from the law of diminishing returns. Big system components offer a 
promising context for tackling the challenges of working at the Top. 


ver the past 50 years, the miniaturiza- 
tion of semiconductor devices has been 
at the heart of improvements in com- 
puter performance, as was foreseen by 
physicist Richard Feynman in his 1959 
address (7) to the American Physical Society, 
“There’s Plenty of Room at the Bottom.” Intel 
founder Gordon Moore (2) observed a steady 
rate of miniaturization and predicted (3) that 
the number of transistors per computer chip 
would double every 2 years—a cadence, called 
Moore’s law, that has held up considerably 
well until recently. Moreover, until about 2004, 
new transistors were not only smaller, they 
were also faster and more energy efficient (4), 
providing computers with ever more speed 
and storage capacity. Moore’s law has driven 
economic progress ubiquitously. 
Unfortunately, Feynman’s “room at the bot- 
tom” is no longer plentiful. The International 
Technology Roadmap for Semiconductors [(5), 
p. 36] foresees an end to miniaturization, and 
Intel [(6), p. 14], a leader in microprocessor 
technology, has acknowledged an end to the 
Moore cadence. Indeed, Intel produced its 
14-nm technology in 2014, but it stalled on 
producing its 10-nm technology, due in 2016, 
until 2019 (7). Although other manufacturers 
continued to miniaturize—for example, with 
the Samsung Exynos 9825 (8) and the Apple 
A13 Bionic (9)—they also failed to meet the 
Moore cadence. There isn’t much more room 
at the bottom. 
Why is miniaturization stalling? It’s stalling 
because of fundamental physical limits—the 
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physics of materials changes at atomic levels— 
and because of the economics of chip manu- 
facturing. Although semiconductor technology 
may be able to produce transistors as small 
as 2 nm (20 A), as a practical matter, min- 
iaturization may end around 5 nm because of 
diminishing returns (0). And even if semi- 
conductor technologists can push things a little 
further, the cost of doing so rises precipitously 
as we approach atomic scales (11, 12). 

In this review, we discuss alternative ave- 
nues for growth in computer performance after 
Moore’s law ends. We believe that opportuni- 
ties can be found in the higher levels of the 
computing-technology stack, which we refer 
to as the “Top.” Correspondingly, by “Bottom” 
we mean the semiconductor technology that 
improved so dramatically during the Moore 
era. The layers of the computing stack harness 
the transistors and other semiconductor de- 
vices at the Bottom into useful computation at 
the Top to solve real-world problems. We di- 
vide the Top into three layers: (i) hardware 
architecture—programmable digital circuits 
that perform calculations; (ii) software—code 
that instructs the digital circuits what to com- 
pute; and (ii) algorithms—efficient problem- 
solving routines that organize a computation. 
We contend that even if device technologies at 
the Bottom cease to deliver performance gains, 
the Top will continue to offer opportunities. 

Unlike Moore’s law, which has driven up 
performance predictably by “lifting all boats,” 
working at the Top to obtain performance will 
yield opportunistic, uneven, and sporadic gains, 
typically improving just one aspect of a par- 
ticular computation at a time. For any given 
problem, the gains will suffer from the law of 
diminishing returns. In the long run, gains 
will depend on applying computing to new 
problems, as has been happening since the 
dawn of digital computers. 


5 June 2020 


Working at the Top to obtain performance 
also differs from the Bottom in how it affects 
a computing system overall. The performance 
provided by miniaturization has not required 
substantial changes at the upper levels of the 
computing stack, because the logical behavior 
of the digital hardware, software, and data in 
a computation is almost entirely independent 
of the size of the transistors at the Bottom. As 
a result, the upper levels can take advantage 
of smaller and faster transistors with little or 
no change. By contrast—and unfortunately— 
many parts of the Top are dependent on each 
other, and thus when one part is restructured to 
improve performance, other parts must often 
adapt to exploit, or even tolerate, the changes. 
When these changes percolate through a sys- 
tem, it can take considerable human effort to 
correctly implement and test them, which in- 
creases both costs and risks. Historically, the 
strategies at the Top for improving perform- 
ance coexisted with Moore’s law and were 
used to accelerate particular applications that 
needed more than the automatic performance 
gains that Moore’s law could provide. 

Here, we argue that there is plenty of room 
at the Top, and we outline promising opportu- 
nities within each of the three domains of soft- 
ware, algorithms, and hardware. We explore 
the scale of improvements available in these 
areas through examples and data analyses. We 
also discuss why “big system components” will 
provide a fertile ground for capturing these 
gains at the Top. 


Software 


Software development in the Moore era has 
generally focused on minimizing the time it 
takes to develop an application, rather than 
the time it takes to run that application once it 
is deployed. This strategy has led to enormous 
inefficiencies in programs, often called soft- 
ware bloat. In addition, much existing software 
fails to take advantage of architectural fea- 
tures of chips, such as parallel processors and 
vector units. In the post-Moore era, software 
performance engineering—restructuring soft- 
ware to make it run faster—can help applica- 
tions run more quickly by removing bloat and 
by tailoring software to specific features of the 
hardware architecture. 

To illustrate the potential gains from per- 
formance engineering, consider the simple 
problem of multiplying two 4096-by-4096 
matrices. Let us start with an implementation 
coded in Python, a popular high-level program- 
ming language. Here is the four-line kernel of 
the Python 2 code for matrix-multiplication: 

for i in xrange(4@96): 

for j in xrange(40@96): 
for k in xrange(4@96): 
Cli] [5] += ALi] [k] * B[k][3] 

The code uses three nested loops and fol- 
lows the method taught in basic linear-algebra 
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Table 1. Speedups from performance engineering a program that multiplies two 4096-by-4096 matrices. Each version represents a successive 
refinement of the original Python code. “Running time” is the running time of the version. “GFLOPS"” is the billions of 64-bit floating-point operations per 
second that the version executes. “Absolute speedup” is time relative to Python, and “relative speedup,” which we show with an additional digit of precision, 
is time relative to the preceding line. “Fraction of peak” is GFLOPS relative to the computer's peak 835 GFLOPS. See Methods for more details. 


Version Implementation 


1 Python 


ieee - Fraction 
Running time (s) GFLOPS Absolute speedup Relative speedup of peak (%) 
25,552.48 0.005 1 = 0.00 


5 Parallel divide and conquer 


3.80 36.180 6,727 


18.4 4.33 


Hf plus AVX intrinsics 


classes. It turns out, however, that this naive 
code leaves much of the performance available 
on modern computers “on the table.” The code 
takes about 7 hours on a modern computer to 
compute the matrix product, as shown by 
the first row (version 1) in Table 1, achieving 
only 0.0006% of the peak performance of the 
machine. (Incidentally, Python 3 requires about 
9 hours for the same computation.) 

How can this naive matrix-multiplication 
code be performance engineered? Simply choos- 
ing a more efficient programming language 
speeds up this calculation dramatically. For 
example, coding it in Java (version 2) produces 
a speedup of 10.8x, and coding it in C (version 3) 
produces an additional speedup of 4.4.x, yield- 
ing an execution time that is 47 times faster 
than the original Python. This performance 
improvement comes from reducing the number 
of operations the program performs. In partic- 
ular, Java and C avoid the extraneous work 
that Python does under the hood to make 
programming easier. The price for this per- 
formance gain is programmer productivity: 
Coding in C is more onerous than coding in 
Python, and Java lies somewhere in between. 

Although switching languages gains a speed- 
up of almost 50x, tailoring the matrix code to 
exploit specific features of the hardware makes 
it run an additional 1300 times faster. This gain 
comes from parallelizing the code to run on all 
18 of the processing cores (version 4), exploiting 
the processor’s memory hierarchy (version 5), 
vectorizing the code (version 6), and using 
Intel’s special Advanced Vector Extensions 
(AVX) instructions (version 7). The final op- 
timized code performs the task in only 0.41 s— 
more than 60,000 times faster than the 7 hours 
of the original Python code! 

The point of this example is to illustrate the 
potential gains available from performance 
engineering naively coded software. In the par- 
ticular case of matrix multiplication, a good 
programmer could avoid this programming 
effort by using optimized code from existing 
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software libraries. If she were writing code to 
solve a new problem, however, she would need 
to optimize the code herself. And although not 
every application can improve by nearly five 
orders of magnitude through performance 
engineering, most modern software systems 
contain ample opportunity for performance 
enhancement, especially if the codebase is 
large enough. 

During the post-Moore era, it will become 
ever more important to make code run fast 
and, in particular, to tailor it to the hardware 
on which it runs. Modern computers provide 
architectural features designed to make code 
run fast. For example, versions 4 and 6 exploit 
parallelism, which is the ability of computers 
to perform multiple operations at the same 
time. Version 5 exploits locality, which is the 
computer’s ability to access data elements ef- 
ficiently when they are collocated in memory 
(spatial locality) or have been accessed re- 
cently (temporal locality). Version 7 exploits 
both parallelism and locality through care- 
fully coordinated use of Intel’s AVX instructions. 
As we shall see in the Hardware architecture 
section, architectures are likely to become in- 
creasingly heterogeneous, incorporating both 
general-purpose and special-purpose circuitry. 
To improve performance, programs will need 
to expose more parallelism and locality for the 
hardware to exploit. In addition, software per- 
formance engineers will need to collaborate 
with hardware architects so that new pro- 
cessors present simple and compelling ab- 
stractions that make it as easy as possible to 
exploit the hardware. 

Beyond the tailoring of software to hard- 
ware is the question of bloat: Where does 
software bloat come from? Certainly, some 
bloat comes from trading off efficiency for 
other desirable traits, such as coding ease, as 
versions 1 to 3 of the matrix-multiplication 
code illustrate. Bloat also comes from a failure 
to tailor code to the underlying architecture, 
as versions 4 to 7 show. But much software 
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bloat arises from software-development strat- 
egies (13, 14), such as reduction. 

The idea of reduction is this. Imagine that 
you are a programmer who has been given a 
problem A to solve (for example, distinguish- 
ing between a yes or no spoken response). 
You could write specialized code to solve A 
directly, but instead, you might notice that 
a related problem B has already been solved 
(existing speech-recognition software that 
understands many words, including yes and 
no). It will take you far less effort to solve A 
by converting it into a problem that can be 
solved with the existing code for B, that is, by 
reducing A to B. 

Inefficiencies can arise both from the re- 
duction itself (translating A to B) and from 
the generality of B (the solution to B is not 
tailored specifically to A). But the largest bloat 
arises from the compounding of reductions: 
reducing A to B, B to C, C to D, and so on. 
Even if each reduction achieves an impressive 
80% efficiency, a sequence of two independent 
reductions achieves just 80% x 80% = 64%. 
Compounding 20 more times yields an effi- 
ciency of less than 1%, or 100~x in bloat. 

Because of the accumulated bloat created by 
years of reductionist design during the Moore 
era, there are great opportunities to make pro- 
grams run faster. Unfortunately, directly solving 
problem A using specialized software requires 
expertise both in the domain of A and in per- 
formance engineering, which makes the pro- 
cess more costly and risky than simply using 
reductions. The resulting specialized software 
to solve A is often more complex than the soft- 
ware that reduces A to B. For example, the fully 
optimized code in Table 1 (version 7) is more 
than 20 times longer than the source code for 
the original Python version (version 1). 

Indeed, simple code tends to be slow, and 
fast code tends to be complicated. To create a 
world where it is easy to write fast code, appli- 
cation programmers must be equipped with the 
knowledge and skills to performance-engineer 
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their code, and productivity tools to assist must 
be improved considerably. 

Abstractly, software performance engineer- 
ing can be viewed as a simple process involv- 
ing a single loop: (i) Measure the performance 
of program A. (ii) Make a change to program A 
to produce a hopefully faster program A’. (iii) 
Measure the performance of program J’. (iv) If 
A' beats A, set A = A’. (v) If A is still not fast 
enough, go to (ii). But today, our systems are 
sufficiently complicated that measurements 
must often be repeated many times to develop 
confidence that one version of a program 
outperforms another. 

As hardware becomes increasingly special- 
ized and heterogeneous (see the Hardware ar- 
chitecture section), high-performing code will 
become even more difficult to write. Because 
faster software will be increasingly important 
for better performance in the post-Moore era, 
however, the computing industry, researchers, 
and government should all be well motivated to 
develop performance-engineering technologies. 


Algorithms 


Algorithmic advances have already made many 
contributions to performance growth and will 
continue to do so in the future. A major goal 
is to solve a problem with less computational 
work. For example, by using Strassen’s algo- 
rithm (15) for matrix multiplication, the highly 
optimized code in version 7 of Table 1 can be 


improved by about 10%. For some problems, 


the gains can be much more impressive: The 
President’s Council of Advisors on Science 
and Technology concluded in 2010 that “per- 
formance gains due to improvements in algo- 
rithms have vastly exceeded even the dramatic 
performance gains due to increased processor 
speed” (emphasis theirs) [(16), p. 71]. 

Because algorithm design requires human 
ingenuity, however, it is hard to anticipate ad- 
vances. To illustrate the nature of algorithmic 
progress, consider the classical operations- 
research problem of finding the maximum 
flow in a network [(77), chap. 26], which can 
be used to model the movement of traffic in 
a road network, blood through the circula- 
tory system, or electricity in a circuit. Linear 
programming is a straightforward way to solve 
the maximum-flow problem, but the 20 years 
between 1975 and 1995 saw a sequence of 
algorithmic innovations that greatly improved 
on it. 

Figure 1 shows the progress in maximum- 
flow algorithms over time. The performance 
gain of the best algorithm has rivaled the gain 
due to Moore’s law over the 38 years of the 
data (just over four orders of magnitude versus 
just under five), even though no new algo- 
rithm has improved the performance of this 
particular problem over the past 20 years. This 
example highlights three salient observations 
about algorithms: (i) Progress on a given al- 
gorithmic problem occurs unevenly and spo- 
radically. (ii) The benefits from algorithmic 
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Fig. 1. Major algorithmic advances in solving the maximum-flow problem on a graph with n = 1072 
vertices and m = n' edges. The vertical axis shows how many problems (normalized to the year 1978) could 
theoretically be solved in a fixed time on the best microprocessor system available in that year. Each major algorithm 
is shown as a circle in the year of its invention, except the first, which was the best algorithm in 1978. The dotted 
trajectory shows how faster computers [as measured by SPECint scores in Stanford's CPU database (56)] make 
each algorithm faster over time. The solid black line shows the best algorithm using the best computer at any point in 
time. Each algorithm's performance is measured by its asymptotic complexity, as described in the Methods. 
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innovation can rival gains from Moore’s law. 
Gii) The algorithmic improvements in solving 
any given problem must eventually diminish. 

Because this example focuses on a well-known 
problem, however, it misses a key aspect of 
how algorithmic performance engineering can 
speed up computing: by providing efficient 
solutions to new problems. For example, more 
than a quarter of the 146 papers at the 2016 
Association for Computing Machinery (ACM) 
Symposium on Discrete Algorithms focus on 
problems that had not previously been studied 
algorithmically. Consequently, although re- 
search on old problems may still yield marginal 
gains, much of the progress in algorithms will 
come from three sources: (i) attacking new 
problem domains, (ii) addressing scalability 
concerns, and (iii) tailoring algorithms to take 
advantage of modern hardware. We discuss 
each source in turn. 

New problem domains continually create 
a need for new algorithms. Domains such as 
machine learning, social networks, security, 
robotics, game theory, sensor networks, and 
video coding were tiny or nonexistent 30 years 
ago but are now economically important enough 
to demand efficient algorithms. Many compa- 
nies have gained a competitive advantage thanks 
to algorithms. Google’s PageRank algorithm 
(18) made its World Wide Web search superior, 
and the auction algorithms of Google AdWords 
(19), which allow advertisers to bid for display 
space based on users’ search terms, made it 
highly profitable. Content-delivery networks, 
which delivered more than half of internet 
traffic in 2016 (20), depend on efficient algo- 
rithms to avoid congestion. Many sciences 
also depend on good algorithms. For exam- 
ple, DNA sequencing in computational biology 
depends on efficient dynamic-programming 
algorithms (27). 

Moore’s law has enabled today’s high-end 
computers to store over a terabyte of data in 
main memory, and because problem sizes 
have grown correspondingly, efficient algo- 
rithms are needed to make solutions afford- 
able. Sublinear algorithms (22, 23) provide one 
example of how to address problems of scale. 
For instance, to find the median of a trillion 
numbers, it takes at least a trillion memory 
operations just to read the input data. But 
many problems do not need the exact median 
and work perfectly well with only a good es- 
timate of the median. For these problems, we 
can instead extract a random sample of, say, 
a million numbers and compute the median 
of that sample. The result is a highly accu- 
rate estimate of the true median that can be 
computed a million times faster. The field of 
algorithms is full of strategies for dealing 
with scalability. 

Tailoring an algorithm to exploit modern 
hardware can make it much faster (Table 1). 
Nevertheless, most algorithms today are still 
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designed using the serial random-access ma- 
chine model (24) originally developed in the 
1960s and 1970s, which assumes that a pro- 
cessor can do only one operation at a time 
and that the cost to access any part of the 
memory is the same. Such algorithms often 
use modern hardware inefficiently because 
they underutilize the machine’s many parallel- 
processing cores and vector units, each of 
which can perform many operations per clock 
cycle, and they fail to exploit caching, which 
can speed up data accesses by two orders of 
magnitude. 

Although algorithms research has devel- 
oped mathematical models for salient features 
of modern computers, such as parallel and 
vector processing (25-32) and cache hierar- 
chies (33-35), a substantial gap between al- 
gorithm and implementation remains. Part 
of the problem is that each model tends to 
address just one aspect—such as parallelism, 
vector units, or caching—and yet tailoring an 
algorithm to a modern computer requires an 
understanding of all of them. Moreover, in 
an effort to gain every bit of performance, some 
hardware features—such as simultaneous mul- 
tithreading, dynamic voltage and frequency 
scaling, direct-mapped caches, and various 
special-purpose instructions—actually make 
it more difficult to tailor algorithms to hard- 
ware, because they cause variability and un- 
predictability that simple theoretical models 
cannot easily capture. 

One possible solution is autotuning (36, 37), 
which searches a parametrized space of pos- 
sible implementations to find the fastest 
one. With modern machine learning, it may 
even be possible to include implementations 
that differ by more than the values of a few 
parameters. Unfortunately, autotuning and 
machine learning tend to be too time con- 
suming to ask that every algorithm incur this 
large up-front cost. Furthermore, these ap- 
proaches actually make algorithm design harder, 
because the designer cannot easily understand 
the ramifications of a design choice. In the post- 
Moore era, it will be essential for algorithm 
designers and hardware architects to work 
together to find simple abstractions that de- 
signers can understand and that architects 
can implement efficiently. 


Hardware architecture 


Historically, computer architects used more 
and more transistors to make serial computa- 
tions run faster, vastly increasing the com- 
plexity of processing cores, even though gains 
in performance suffered from diminishing 
returns over time (38). We argue that in the 
post-Moore era, architects will need to adopt 
the opposite strategy and focus on hardware 
streamlining: implementing hardware func- 
tions using fewer transistors and less sili- 
con area. 
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As we shall see, the primary advantage of 
hardware streamlining comes from provid- 
ing additional chip area for more circuitry 
to operate in parallel. Thus, the greatest ben- 
efit accrues to applications that have ample 
parallelism. Indeed, the performance of hard- 
ware for applications without much parallel- 
ism has already stagnated. But there is plenty 
of parallelism in many emerging application 
domains, such as machine learning, graphics, 
video and image processing, sensory comput- 
ing, and signal processing. Computer architects 
should be able to design streamlined archi- 
tectures to provide increasing performance for 
these and other domains for many years after 
Moore’s law ends. 

We can use historical data to observe the 
trend of architectural reliance on parallel- 
ism. Figure 2 plots three sets of benchmark 
data for microprocessors: SPECint performance 
(black squares and gray diamonds), SPECint- 
rate performance (black, orange, blue, and 
red squares), and microprocessor clock fre- 
quency (green dots). As the green dots in the 
figure show, clock speed increased by a fac- 
tor of more than 200 from 1985 to 2005, when 
it plateaued owing to the end of Dennard 
scaling, which we shall discuss shortly. Driven 
by increasing clock speed and other architec- 


tural changes during the Dennard-scaling era, 
microprocessor performance rapidly improved, 
as measured by the SPECint and SPECint- 
rate benchmarks (black squares), which aim 
to model computer performance on typical 
user workloads (39). The SPECint benchmark 
consists of mostly serial code, whereas the 
SPECint-rate benchmark is parallel. The two 
benchmarks perform the same on single- 
processor computers. But after 2004, as ma- 
chines added multiple cores and other forms 
of explicit parallelism, the two diverge. In- 
deed, the performance of parallel applications 
on the best-performing chips in each year 
(colored squares) grew by a factor of 30 from 
2004 to 2015, improving on average by about 
a factor of two every 2 years. By contrast, over 
the same time period, the largely serial SPECint 
benchmark (gray diamonds) scaled up by only 
a factor of three. 

Besides parallelism, an application needs 
locality to benefit from streamlining. As an ex- 
ample, when data are transferred from external 
dynamic random access memory (DRAM) 
memory chips to a processing chip, it should 
be used multiple times before being transferred 
back. For an application with little locality, 
increasing parallelism causes traffic to off- 
chip memory to increase proportionally and 
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Fig. 2. SPECint (largely serial) performance, SPECint-rate (parallel) performance, and clock-frequency 
scaling for microprocessors from 1985 to 2015, normalized to the Intel 80386 DX microprocessor in 
1985. Microprocessors and their clock frequencies were obtained from the Stanford CPU database (56). 
Microprocessor performance is measured in terms of scaled performance scores on the SPECint and 
SPECint-rate performance benchmarks obtained from (39). (See Methods for details.) Black squares identify 
single-core processors, for which SPECint and SPECint-rate benchmark performances are the same. 
Orange, blue, and red squares plot the SPECint-rate benchmark performance of various multicore processors, 
where orange squares identify processors with two to three cores, blue squares identify processors with four 
to seven cores, and red squares identify processors with eight or more cores. The gray diamonds plot the 
SPECint benchmark performance on multicore processors. The round green dots plot processor clock frequencies 
(also normalized to the Intel 80386). The gray background highlights the Dennard-scaling era (nominally up 

to 2004), and the white background highlights the multicore era (beyond 2004). 
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eventually exceeds the bandwidth of its chan- 
nel to memory, so that the application is 
memory bound. For an application with good 
locality, however, as parallelism increases, the 
amount of off-chip memory traffic increases 
much more slowly, enabling all the chip’s com- 
puting engines to do useful work without 
idling. Fortunately, many important applica- 
tion domains contain plenty of both locality 
and parallelism. 

Hardware streamlining can exploit locality 
in other ways, especially for domain-specific 
processors, which we shall discuss shortly. 
For example, explicit data orchestration (40) 
exploits locality to increase the efficiency with 
which data are moved throughout the mem- 
ory hierarchy [(41), chap. 4]. On-chip inter- 
connects can become simpler and consume 
less power and area if the application using 
them contains locality. For example, systolic 
arrays (42) can perform matrix computa- 
tions more efficiently using an area-efficient 
mesh interconnect than a general-purpose 
interconnect. 

Although hardware will increase in capabil- 
ity because of streamlining, we do not think 
that average clock speed will increase after 
Moore’s law ends, and it may in fact diminish 
slightly. Figure 2 shows that clock speed pla- 
teaued in 2005, when microprocessor design 
became power constrained. Before 2004, com- 
puter architects found ways to increase clock 
frequency without hitting hard power limits. 
Dennard scaling—reducing voltage as clock 
frequency increased—allowed processors to 
run faster without increasing power usage. 
(In practice, processor manufacturers often 
increased clock frequency without reducing 
voltage proportionally, which did increase 
chip power.) Since 2004, however, Moore’s 
law has provided many more transistors per 
chip, but because the ability to power them 
has not grown appreciably (43), architects 
have been forced to innovate just to prevent 
clock rates from falling. Slightly lower clock 
frequency and supply voltage reduce the 
power per transistor enough that substan- 
tially more circuitry can run in parallel. If the 
workload has enough parallelism, the added 
computing more than compensates for the 
slower clock. Serial applications may see some- 
what worse performance, but cleverness can 
reduce this cost. For example, Intel’s “Turbo” 
mode [(4J), p. 28], runs the clock faster when 
fewer cores are active. (Other techniques to 
reduce transistor switching include using more 
transistors in caches, power-gating unused cir- 
cuitry, and minimizing signal switching.) 

Now that designers have embraced paral- 
lelism, the main question will be how to 
streamline processors to exploit application 
parallelism. We expect two strategies to domi- 
nate: processor simplification and domain 
specialization. 
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Processor simplification (44) replaces a 
complex processing core with a simpler core 
that requires fewer transistors. A modern 
core contains many expensive mechanisms 
to make serial instruction streams run faster, 
such as speculative execution [(47), section 
3.6], where the hardware guesses and pur- 
sues future paths of code execution, aborting 
and reexecuting if the guess is wrong. If a 
core can be simplified to occupy, say, half as 
many transistors, then twice as many cores 
can fit on the chip. For this trade-off to be 
worthwhile, the workload must have enough 
parallelism that the additional cores are kept 
busy, and the two simplified cores must do 
more useful computing than the single com- 
plex one. 

Domain specialization (71, 43, 45) may be 
even more important than simplification. 
Hardware that is customized for an appli- 
cation domain can be much more streamlined 
and use many fewer transistors, enabling ap- 
plications to run tens to hundreds of times 
faster (46). Perhaps the best example today is 
the graphics-processing unit (GPU) [(4J), sec- 
tion 4.4], which contains many parallel “lanes” 
with streamlined processors specialized to 
computer graphics. GPUs deliver much more 
performance on graphics computations, even 
though their clocks are slower, because they 
can exploit much more parallelism. GPU logic 
integrated into laptop microprocessors grew 
from 15 to 25% of chip area in 2010 to more 
than 40% by 2017 (see Methods), which shows 
the importance of GPU accelerators. Moreover, 
according to the Top 500 website, which tracks 
high-performance computing technology, only 
about 10% of supercomputers that were added 
to the Top 500 list in the year 2012 contained 
accelerators (often GPUs), but by 2017, that 
share had grown to 38% (72). Hennessy and 
Patterson (45) foresee a move from general- 
purpose toward domain-specific architectures 
that run small compute-intensive kernels of 
larger systems for tasks such as object recogni- 
tion or speech understanding. The key require- 
ment is that the most expensive computations 
in the application domain have plenty of par- 
allelism and locality. 

A specialized processor is often first imple- 
mented as an attached device to a general- 
purpose processor. But the forces that encourage 
specialization must be balanced with the forces 
that demand broadening: expanding the func- 
tionality of the specialized processor to make 
it more autonomous from the general-purpose 
processor and more widely useful to other 
application domains (47). 

The evolution of GPUs demonstrates this 
trade-off. GPUs were originally developed spe- 
cifically for rendering graphics, and as a result, 
GPUs are next to useless for many other com- 
putational tasks, such as compiling computer 
programs or running an operating system. But 


5 June 2020 


GPUs have nevertheless broadened to be handy 
for a variety of nongraphical tasks, such as lin- 
ear algebra. Consider the matrix-multiplication 
problem from the Software section. An Advanced 
Micro Devices (AMD) FirePro $9150 GPU (48) 
can produce the result in only 70 ms, which 
is 5.4 times faster than the optimized code 
(version 7) and a whopping 360,000 times faster 
than the original Python code (version 1). 

As another example of the trade-off be- 
tween broadening and specialization, GPUs 
were crucial to the “deep-learning” revolution 
(49), because they were capable of training 
large neural networks that general-purpose 
processors could not train (50, 57) fast enough. 
But specialization has also succeeded. Google 
has developed a tensor-processing unit (TPU) 
(52) specifically designed for deep learning, 
embracing special-purpose processing and 
eschewing the broader functionality of GPUs. 

During the Moore era, specialization usually 
yielded to broadening, because the return on 
investment for developing a special-purpose 
device had to be amortized by sales over the 
limited time before Moore’s law produced a 
general-purpose processor that performs just 
as well. In the post-Moore era, however, we 
expect to see more special-purpose devices, 
because they will not have comparably per- 
forming general-purpose processors right 
around the corner to compete with. We also 
expect a diversity of hardware accelerators 
specialized for different application domains, 
as well as hybrid specialization, where a single 
device is tailored for more than one domain, 
such as both image processing and machine 
learning for self-driving vehicles (53). Cloud 
computing will encourage this diversity by ag- 
gregating demand across users (12). 


Big components 


In the post-Moore era, performance engineer- 
ing, development of algorithms, and hardware 
streamlining will be most effective within big 
system components (54). A big component is 
reusable software with typically more than a 
million lines of code, hardware of compara- 
ble complexity, or a similarly large software- 
hardware hybrid. This section discusses the 
technical and economic reasons why big com- 
ponents are a fertile ground for obtaining 
performance at the Top. 

Changes to a system can proceed without 
much coordination among engineers, as long 
as the changes do not interfere with one an- 
other. Breaking code into modules and hid- 
ing its implementation behind an interface 
make development faster and software more 
robust (55). Modularity aids performance en- 
gineering, because it means that code within 
a module can be improved without requir- 
ing the rest of the system to adapt. Likewise, 
modularity aids in hardware streamlining, 
because the hardware can be restructured 
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without affecting application programming 
interfaces (APIs). Performance engineering 
and hardware streamlining that do not re- 
quire coordination already occur this way, 
and we expect them to continue to do so. 

Many of the most valuable future opportu- 
nities for improving performance will not 
arise locally within a single module, however, 
but from broad-based and systematic changes 
across many modules affecting major parts of 
a system. For example, because so much soft- 
ware is reductionist, the larger a system, the 
greater the opportunity for huge performance 
gains when layer upon layer of reductions are 
replaced by a lean and direct implementa- 
tion. But large-scale reengineering that affects 
many modules requires the coordination of 
many engineers, which is costly and risky. 

Big system components (web browsers, data- 
base systems, operating systems, micropro- 
cessors, GPUs, compilers, disk-storage systems, 
and so on) offer both technical opportunities 
for making applications run fast and a con- 
text where it is economically viable to take ad- 
vantage of them. As an example of the types of 
changes that can be made within a big compo- 
nent, consider a microprocessor. An instruction- 
set architecture (ISA) is the interface by which 
software tells the processor what to do. Man- 
ufacturers routinely make major internal changes 
to improve performance without changing 
the ISA so that old software continues to run 
correctly. For example, the Intel 8086 released 
in 1978 had 29,000 transistors (56), whereas 
the 22-core Xeon E5-2699 v4, released in 2016, 
has about 248,000 times more transistors (57) 
that produce more than a million times better 
performance, according to SPECint rate (39). 
In that time, the ISA has grown by less than 
a factor of four (58), and old programs con- 
tinue to work even as the interface adds new 
functionality. 

Big components provide good opportunities 
for obtaining performance, but opportunity 
alone is not enough. To outweigh the costs 
and risks of making changes, an economic 
incentive must exist. If a commercial enter- 
prise or a nonprofit organization owns a big 
component, it can justify the investment to 
enhance performance because it reaps the 
benefit when the job is done. Single owner- 
ship also helps with coordination costs. If 
many parties must agree to make a change, 
but it takes only a few to veto it, then it can 
be hard for change to happen. Even the fear 
of high coordination costs can block change 
in a large codebase if too many parties are 
involved. But when a single entity owns a big 
component, it has the power to make vast 
changes and pool the costs and benefits. It 
can choose to reengineer as many modules 
as it can justify economically and coordinate 
with the outside world only at the big com- 
ponent’s interface. 
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Conclusions 
As miniaturization wanes, the silicon-fabrication 
improvements at the Bottom will no longer 
provide the predictable, broad-based gains in 
computer performance that society has enjoyed 
for more than 50 years. Performance-engineering 
of software, development of algorithms, and 
hardware streamlining at the Top can con- 
tinue to make computer applications faster in 
the post-Moore era, rivaling the gains accrued 
over many years by Moore’s law. Unlike the 
historical gains at the Bottom, however, the 
gains at the Top will be opportunistic, uneven, 
sporadic, and subject to diminishing returns 
as problems become better explored. But 
even where opportunities exist, it may be 
hard to exploit them if the necessary mod- 
ifications to a component require compatibil- 
ity with other components. Big components 
can allow their owners to capture the eco- 
nomic advantages from performance gains at 
the Top while minimizing external disruptions. 
In addition to the potential at the Top, nas- 
cent technologies—such as 3D stacking, quan- 
tum computing, photonics, superconducting 
circuits, neuromorphic computing, and graphene 
chips—might provide a boost from the Bottom. 
At the moment, these technologies are in their 
infancy and lack the maturity to compete with 
today’s silicon-based semiconductor technol- 
ogy in the near future (59). Although we ap- 
plaud investment in these technologies at the 
Bottom because of their long-term potential, 
we view it as far more likely that, at least in the 
near term, performance gains for most appli- 
cations will originate at the Top. 


Methods 
Table 1 


Each running time is the minimum of five runs 
on an Amazon AWS c4.8xlarge spot instance, 
a dual-socket Intel Xeon E5-2666 v3 system 
with a total of 60 gibibytes of memory. Each 
Xeon is a 2.9-GHz 18-core CPU with a shared 
25-mebibyte L3-cache. Each processor core 
has a 32-kibibyte (KiB) private L1-data-cache 
and a 256-KiB private L2-cache. The machine 
was running Fedora 22, using version 4.0.4 of 
the Linux kernel. The Python version was exe- 
cuted using Python 2.7.9. The Java version was 
compiled and run using OpenJDK version 
1.8.0_51. All other versions were compiled using 
GNU Compiler Collection (GCC) 5.2.1 20150826. 


Figure 1 


Each curve models how hardware improvements 
grow the performance of a fixed maximum- 
flow algorithm, starting from the year the al- 
gorithm was published. The performance of 
each algorithm was measured as the number 
of graphs of n = 10” vertices, m =n" ~ 15.8 x 
10” edges, and 64-bit integer capacities that 
can be solved in a fixed amount of time, nor- 
malized such that the first point starts at 1. We 
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calculated the performance of each algorithm 
based on its asymptotic complexity in terms of 
big-O notation, assuming that the constant 
hidden by the big-O is 1. We excluded approx- 
imate algorithms from consideration, because 
these algorithms do not necessarily return the 
same qualitative answer. We also excluded algo- 
rithms whose stated asymptotic bounds ignore 
logarithmic factors to simplify the performance 
comparisons. We identified four maximum- 
flow algorithms developed since 1978 that 
each delivered a performance improvement of 
more than a factor of four over its predecessor: 
Edmonds and Karp’s O(m7 logU) algorithm (60); 
Sleator and Tarjan’s O(mnlogn) algorithm (67); 
Ahuja, Orlin, and Tarjan’s O(mnlog(nV/logU / 
m+ 2)) algorithm (62); and Goldberg and 
Rao’s O(n?/*mlog(n?/m)logU) algorithm (63). 
Each curve plots the best performance achieved 
by any processor up to that date, on the basis 
of scaled MIPS (million instructions per sec- 
ond) estimates and SPECint scores recorded in 
Stanford’s CPU database (56). These processor 
performance measurements are normalized 
using the same method as in Fig. 2. 

Figure 1 ignores the constant factors in the 
performance of maximum-flow algorithms 
because comparably engineered implemen- 
tations of these algorithms are not available. 
Nevertheless, the effects from changes in con- 
stant factors between algorithms can be in- 
ferred from the plot. For example, if a constant 
factor were 10 times larger between one al- 
gorithm and a later one, then the curve for 
the later algorithm would be one order of 
magnitude lower (i.e., one division less on 
the y axis). 


Figure 2 


Performance measurements for different pro- 
cessors were gathered from Stanford’s CPU data- 
base (56). For a processor released before 1992, 
its performance is measured as the recorded 
estimate of how many MIPS that processor can 
perform. For subsequent processors, each per- 
formance measurement corresponds to the 
best recorded score for that processor on either 
the SPECint1992, SPECint1995, SPECint2000, 
or SPECint2006 benchmarks. Performance for 
multicore processors is measured using both 
standard SPECint scores (gray diamonds) and 
in terms of throughput (multicolored squares), 
using SPECint2006 rate scores. All SPECint 
performance scores were obtained from (39). 
The date for an Intel processor released after 
2004 is plotted as the first day in the quarter 
when that processor was released, unless a 
more precise release date was found. Scores 
for different SPECint benchmarks were nor- 
malized by scaling factors, which were com- 
puted from the geometric-mean ratios of scores 
for consecutive versions of SPECint for pro- 
cessors that were evaluated on both versions. 
The MIPS estimates were normalized with 
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INTRODUCTION: Plasmids are autonomously 
replicating, nonessential DNA molecules that 
accelerate the evolution of many important 
bacterial-driven processes. For example, plas- 
mids spread antibiotic resistance genes, which 
are a pressing problem for human and animal 
health. Plasmids can also encode complex traits 
that allow bacteria to interact intimately with 
eukaryotes. Acquisition of an oncogenic tumor- 
inducing (Ti) or root-inducing (Ri) plasmid 
by saprophytic soil agrobacteria changes them 
into pathogens capable of genetically trans- 
forming and causing disease in a broad range 
of plant species. 

Plasmids are also biotechnology tools that 
can advance our understanding of life. They 
can be used to generate organisms with un- 
usual traits and innovative applications. The 
potential for using oncogenic plasmids to 
accelerate research was recognized early in 
their discovery. Along with strains of agro- 
bacteria, disarmed plasmids are mainstays 
as tools in plant biology and plant genetic 
engineering. 


RATIONALE: Inferring evolutionary relation- 
ships is foundational for classifying plas- 
mids, accurately assessing the influence of 
plasmids on disease outbreaks, developing 
appropriate strategies for mitigating disease, 
and expediting efforts to leverage plasmid 
diversity for biotechnology. However, such 
research is complicated because plasmids 


Combined genomic analyses 
of chromosomal and plasmid 
identities to model disease 
spread. Genomic data from 
hundreds of strains of 
agrobacteria were parsed 

and analyzed to infer 

the evolutionary histories 

of chromosomes and | 
oncogenic Ti and Ri plasmids. 

The data were overlaid to 

uncover the roles of bacteria 

and plasmids in the global 

spread of disease. 


Ancestral relationships at 
the chromosomal level 
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consist of diverse structural variants and are 
extraordinarily dynamic, modular molecules 
that can be reshuffled and broadly transmitted 
horizontally. 

We focused on oncogenic plasmids of agro- 
bacteria because of their important roles in 
causing disease and as biotechnology tools. 
Two genomic datasets were developed. One 
consisted of diverse, broadly sampled his- 
torical strains and was intended to serve as 
the basis for an evolutionary framework. The 
other consisted of contemporaneous strains 
hierarchically sampled from managed plant 
production sites, for the purpose of cali- 
brating epidemiology methods. The datasets 
were combined to identify epidemiological 
patterns. 


RESULTS: We combined analyses of chromo- 
somal ancestry and plasmids to uncover their 
contributions and accurately model the global 
spread of disease. Phylogenetic, genomic, and 
time tree analyses of thousands of strains from 
the Rhizobiales order yielded a robust phylo- 
genetic history of agrobacteria. We developed 
a strategy that uses phylogenetic and network 
approaches as well as different scales of ge- 
netic information to infer the evolution of 
diverse oncogenic plasmids. By combining 
results, we uncovered global epidemiological 
patterns supporting movement of pathogens 
clonally and plasmids horizontally in space 
and time. 


Plasmid identity, evolution, 
and modularity 
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This study has three major findings: (i) 
Lineages of agrobacteria emerged indepen- 
dently and at different times from within a 
genus-level group that also circumscribes 
multiple lineages of rhizobia. (ii) Agrobac- 
terial Ti and Ri plasmids are descended from 


only six and three line- 
ON OUR WEBSITE 


ages (types), respectively. 
Read the full article Few evolutionary events 
at https://dx.doi. 


are sufficient to explain 
org/10.1126/ the relationships observed 
science.aba5256 among types. Each type 
is subject to different 
pressures and shows different degrees of 
within-group variation, but their evolution 
is nonetheless guided by similar principles. 
The extent of modularity is high, and genes 
and functional modules are frequently re- 
shuffled via recombination within conserved 
loci. Yet plasmid diversification is nonethe- 
less constrained by the spatial structure of 
loci that interact genetically. (iii) Transmis- 
sion of oncogenic plasmids, especially within 
agricultural settings, promotes the massive 
spread of disease. 


CONCLUSION: Our strategy for inferring the 
evolution and transmission of virulence plas- 
mids has potential applications in plant and 
human or animal health and food safety, as 
well as for understanding the ecology and 
evolution of other plasmid-mediated pro- 
cesses such as mutualistic symbioses. In ad- 
dition, this strategy can be applied to study 
other mobile and modular elements, such as 
integrative conjugative elements and path- 
ogenicity or symbiosis islands. We have shown 
that plasmids once viewed as too diverse to be 
classified have distinct lineages, and that ac- 
curate modeling of the spread of disease can 
be accomplished by robustly defining their 
evolutionary relationships. 
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The accelerated evolution and spread of pathogens are threats to host species. Agrobacteria require 
an oncogenic Ti or Ri plasmid to transfer genes into plants and cause disease. We developed 

a strategy to characterize virulence plasmids and applied it to analyze hundreds of strains collected 
between 1927 and 2017, on six continents and from more than 50 host species. In consideration of prior 
evidence for prolific recombination, it was surprising that oncogenic plasmids are descended from a 
few conserved lineages. Characterization of a hierarchy of features that promote or constrain plasticity 
allowed inference of the evolutionary history across the plasmid lineages. We uncovered epidemiological 
patterns that highlight the importance of plasmid transmission in pathogen diversification as well as in 
long-term persistence and the global spread of disease. 


gricultural ecosystems promote the rap- 

id evolution and diversification of path- 

ogens (J). These ecosystems increase 

pathogen population sizes, lower bar- 

riers for transmission, and increase 
opportunities for horizontal exchange of viru- 
lence genes. Understanding the genetic basis 
for how pathogens emerge and diversify in 
agricultural ecosystems is foundational for 
determining their spread and assessing risks. 
Such knowledge is critical to policies for im- 
proving plant health and preparing against 
disease outbreaks to increase global food 
security (2). 

Plasmids that confer virulence and antimi- 
crobial resistance are evolutionary drivers of 
pathogenic bacteria that affect plant, human, 
and animal health (3-6). Some plasmids can 
mediate conjugation and transmit horizon- 
tally within and across species to diversify 
pathogen populations. The development of 
strategies to infer horizontal transfer of plas- 
mids is crucial for accurately assessing dis- 
ease outbreaks and instituting measures to 
limit risks from disease. However, it is dif- 
ficult to trace dissemination of plasmids and 
integrate findings with analyses of chromo- 
somes, because plasmids can be horizontally 
transferred and hence can defy inference of 
evolutionary histories (7). In addition, plas- 
mids tend to have few conserved regions, high 
numbers of repeated sequences, high rates 
of gene exchange, and many structural vari- 
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ants. Even core genes such as those associated 
with replication or mobility are susceptible 
to recombination. 

Agrobacteria are a diverse, polyphyletic group, 
and its members are common in soils and 
on many species of plants. These bacteria have 
multipartite genomes with two chromosomes 
and nonvirulence plasmids, which are extreme- 
ly diverse among agrobacteria (8). Pathogenic 
strains additionally carry conjugative onco- 
genic tumor-inducing (Ti) or root-inducing 
(Ri) plasmids and can infect plants to cause 
crown gall or hairy root diseases, respectively 
(9). These diseases are incurable and persist 
for the duration of the lives of infected plants, 
which continually release opines, metabolites 
that are central to the ecology and epidemi- 
ology of agrobacteria. 

Oncogenic plasmids are exceptionally well 
characterized because of their applications 
in plant biotechnology and hence are models 
for understanding the impact of plasmids on 
disease ecology (10, 11). Core to oncogenic 
plasmids are repABC replication genes, tra 
and trb interbacterial conjugation genes, and 
vir genes. Vir proteins are necessary to process 
and escort a region, the transfer DNA (T-DNA), 
of oncogenic plasmids into host cells, where 
it recombines into the genome to genetically 
transform host plants. T-DNAs include on- 
cogenes that reprogram transformed cells to 
proliferate. One oncogene present on T-DNAs 
in all oncogenic plasmids is tmsI (auxl/iaaM), 
which is involved in the synthesis of auxins, a 
class of plant growth-promoting hormones 
(12). T-DNAs also include genes responsible 
for the synthesis of opines. More than 20 chem- 
ically diverse opines have been identified among 
oncogenic plasmids (73). Opines are nutrients 
for the pathogen and act as signals that trigger 
replication and interbacterial conjugation of 
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oncogenic plasmids. Genes necessary for uptake 
and catabolism of opines are spatially sepa- 
rated from cognate synthesis genes and are 
located outside of T-DNAs. This arrangement 
led to the hypothesis that instigating pathogens 
are privileged in accessing the opines (4). 

Beyond the core set of genes, oncogenic plas- 
mids vary extensively in composition and struc- 
ture (15). This diversity is a major challenge, 
best expressed by the sentiment that it is 
practically impossible to reconstruct the evo- 
lution of agrobacteria and oncogenic plasmids, 
even with the availability of an extraordinarily 
large genomic dataset (16). Here, we overcame 
this challenge and analyzed genomic data from 
hundreds of strains, and applied the results to 
infer the evolutionary history and transmis- 
sion of oncogenic plasmids. 


Chromosomal ancestry 


A robust phylogeny of bacteria is an essential 
foundation for understanding the evolution 
and transmission of the plasmids that they 
host. Hence, it was critical to reconcile contro- 
versies over the complex evolutionary rela- 
tionships among agrobacteria (see data S1 
for synonymous terms) (17, 18). Sequenced 
strains were reclassified in the context of a 
dataset of nearly 1500 additional strains that 
represent families within the order Rhizobiales 
(data S1 and S2). Combined phylogenetic and 
genomic analyses indicated that agrobacte- 
ria, Rhizobium, Allorhizobium, Neorhizobium, 
Enstfer/Sinorhizobium, Pararhizobium, and 
Shinella are related at a level of a genus, re- 
ferred to as the agrobacteria-rhizobia complex 
(ARC; fig. S1 and data S3). Within the ARC, 
groups traditionally known as Agrobacterium 
tumefaciens, Agrobacterium rhizogenes, and 
Agrobacterium vitis are called biovar (BV) 1, 
BV2, and BV3, respectively (19). BV1 was prev- 
iously subclassified into genomospecies (20). 
Whole-genome analysis suggested that most 
genomospecies (excepting G7 and G8, which 
each represent three groups) are commensu- 
rate with species-level groups (data S4). BV2 is 
homogeneous and represents a single species- 
level group. BV2 is sister to a newly defined 
clade of pathogenic agrobacteria (BV2-like). 
BV3 forms multiple species-level groups. 

A single lineage of agrobacteria was previ- 
ously estimated to have diverged from rhizobia 
between ~250 million and ~150 million years 
ago (21, 22). Considering the new interpreta- 
tion of their phylogeny, we reevaluated esti- 
mates of divergence times (Fig. 1, fig. $2, and 
data S5). Lineages of agrobacteria emerged 
independently, at different times in the his- 
tory of the ARC, and are more closely related 
to lineages of rhizobia than to each other. BV1 
emerged 48 + 10 million years ago, and its 
clade has substructure with long branch lengths, 
which suggests that species-level groups di- 
verged early in its history. The BV2 lineage is 
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Fig. 1. Time-calibrated phylogenetic tree of the agrobacteria-rhizobia complex. Blue horizontal bars 
indicate confidence intervals for each split. Clades | to IV are defined in fig. Sl. Key groups are labeled; 
three-letter codes are for select species of rhizobia or agrobacteria without biovar classifications: Run 

(R. undicola), Ala (A. larrymoorei), Ask (A. skierniewicense), Aru (A. rubi), and Aar (A. arsenijevicii); synonyms 
are listed in data Sl. Each strain is colored according to its species-level classification. Two groups 

(black bars) within BV1 are not associated to established genomospecies. 


estimated to have emerged only 1.6 + 0.5 mil- 
lion years ago, and its short branch lengths 
and low genetic diversity are consistent with 
the possibility that BV2 is recovering from a 
recent bottleneck (fig. S2). In the time tree, 
BV3 diverged independently from other line- 
ages of agrobacteria (Fig. 1). However, its rela- 
tionships in the time tree are incongruent 
with those of the phylogenetic tree (fig. S1). 


Plasmid identities 


We next addressed the difficult task of re- 
solving the relationships of the oncogenic 
plasmids. Because plasmid genes do not meet 
the assumptions of traditional phylogenetic 
methods, we used multiple approaches to com- 
pare and cross-validate findings. A total of 
143 oncogenic plasmid sequences were iden- 
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tified. We analyzed different levels of genetic 
features, including signatures of subsequences 
of length k (k-mers), a 43-core gene phylogeny, 
a phylogenetic network derived from 144 
single-copy genes present in at least 40% of 
all plasmids, and patterns of composition as 
well as organization. To our surprise, the re- 
sults were consistent and allowed us to cat- 
egorize the molecules into just nine distinct 
lineages of six Ti (type I-VI) and three Ri 
(type I-IID oncogenic plasmids (Figs. 2 and 3, 
figs. S3 to S9, and data S6 and S7). Type I and 
type IV Ti plasmids were further divided into 
two and three subtypes, respectively. 

In the k-mer network, most types segre- 
gated into distinct graphs (Fig. 2A). However, 
type IV.c Ti plasmids are high-degree nodes 
that connect type I Ti plasmids to the other 
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subtypes of type IV Ti plasmids. In the phylo- 
genetic network, there were many splits, yet 
plasmids clustered via short edges into mono- 
phyletic clades (Fig. 2B). This topology is 
consistent with extensive and ancient recom- 
bination events prior to divergence of plas- 
mid lineages. The two networks show that 
the type I Ti plasmids are most closely related 
to type IV plasmids, which is consistent with 
their similarities in structure and gene com- 
position (fig. S10 and data S8). Other relation- 
ships were also observed in the phylogenetic 
network. Type I Ti plasmids are related to the 
type VI Ti plasmid (Fig. 2B). Type II plasmids 
are nested within the type III Ti plasmids and 
are more closely related to each other than 
to other types of Ti plasmids. Type V Ti plas- 
mids are the most distinct of the sequenced 
Ti plasmids. Type II and type III Ri plasmids 
are more closely related to each other than to 
type I Ri plasmids. 

Within this sequenced dataset, there are 
relationships among plasmids, bacterial spe- 
cies, and plant hosts, some of which had not 
been previously noted and could potentially 
be of use in biotechnology applications of 
agrobacteria (Fig. 2C and fig. $3). BV1 has the 
most diverse spectrum of oncogenic plasmids, 
with four types of Ti plasmids and type I Ri 
plasmids. Type III Ti plasmids are exclusively 
in BVI. BV2 tends toward having only one of 
the two minor variants of type 1.a Ti plasmids 
(fig. SI5A). However, some have type II and 
type IV.c Ti plasmids as well as type III Ri 
plasmids. BV3 strains exclusively carry type 
IV.a, type IV.b, and type V Ti plasmids. Strains 
carrying type I Ti plasmids were isolated pre- 
dominately from woody plants, whereas those 
with type III Ti plasmids were exclusively from 
herbaceous plants. With the exception of only 
two strains, those belonging to BV2 were cul- 
tured from woody plants. BV3 strains were 
exclusively cultured from grapevine, which is 
also host to BV1 and BV2 strains. 

There are different degrees of variation with- 
in each type of oncogenic plasmid. We com- 
bined gene synteny, gene annotation, and 
sequence data for a comprehensive compar- 
ison of patterns of variation in plasmid types 
(Fig. 3A, figs. S5 to S9, and data S9). Type I.a 
and type II Ti plasmids are relatively con- 
served in gene composition and structure. 
Type I.b and type III Ti plasmids are more 
diverse, with gene presence/absence variation 
present in T-DNAs, opine-associated loci, and 
regions flanking the vir loci. Despite the lower 
sampling depth, a range in gene presence/ 
absence was also observed among the Ri plas- 
mids. Across all but type I and type V Ti plas- 
mids, higher variation in gene composition is 
present within T-DNAs and proximal to the 
right border. Variation extends to the region 
neighboring the right border of T-DNAs in 
type I.b and type III Ti plasmids and all types 
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Fig. 2. Nine distinct lineages of oncogenic plasmid types. (A) Weighted undirected network of oncogenic 
plasmids. Nodes represent individual oncogenic plasmids and are colored according to the type and subtype 
of Ti (top) and Ri (bottom) plasmids. Darker edges indicate greater Jaccard similarity of k-mer signatures. 
(B) A split network of the oncogenic plasmids. Branch thickness indicates relative support for the split. 
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concatenated sequences from 43 single-copy core genes (data S7). Tips (circles and triangles) are color- 
coded according to plasmid type. Colored panels in the top row below the tree denote the type of plant from 
which strains were cultured. Colored panels in the bottom row indicate the classification of the strains. 
The same color scheme for plasmid types is used in each of the three panels. The tree is midpoint-rooted. 
See fig. S3 for a more detailed and larger version of (C). 


of Ri plasmids. In these plasmids, the re- 
gion encodes proteins necessary for trans- 
port and catabolism of opines. As previously 
noted, noncore genes of most plasmid types 
are highly variable. The tra and trb loci of 
type I-III Ti plasmids have extensive small- 
scale changes. 


Plasmid evolution 


The range in variation across types of onco- 
genic plasmids suggests that each of the types 
has been shaped by different evolutionary 
processes. We modeled plasmid histories to 
understand these processes and infer origins 
and relationships. In doing so, a constant theme 
emerged: Core genes used to characterize 
oncogenic plasmids not only have low phylo- 
genetic value but are in fact the major con- 
tributors to their variation. Their conservation 
among plasmids provides regions for recombi- 
nation and reshuffling of genes to occur. We 
therefore analyzed plasmid types at different 
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scales to infer patterns of evolution. Features 
of oncogenic plasmids were also accounted 
for separately because of their different evolu- 
tionary histories. 

Different levels of modularity and varia- 
tion in gene analogs contribute to oncogenic 
plasmids maintaining their function while 
adopting diverse gene composition and struc- 
ture. The vir locus is central to virulence of 
agrobacteria yet is prone to exchange among 
plasmid types. Results from k-mer and phylo- 
genetic analyses were not consistent with 
grouping on the basis of locus structure or with 
relationships among plasmid types (Fig. 4A 
and figs. S11 and S12A). The vir loci of type L.a, 
type I.b, type I, type III, and type VI Ti plas- 
mids as well as type II and type ITI Ri plasmids 
share a recent common ancestor and follow 
one of the paths in the k-mer graph. Those of 
type IV and type V Ti plasmids and type I Ri 
plasmids share a recent common ancestor 
and follow the second path. 
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The vir locus is often acquired as separate 
modules. There are two vir loci in type IV.c Ti 
plasmids. The primary vir locus is hypothe- 
sized to have been acquired from a Ri plasmid. 
By itself, this locus is likely not functional be- 
cause of multiple transposase genes interrupt- 
ing virA, which encodes a regulator of the vir 
genes (fig. S12B). The second vir locus is a rem- 
nant that encompasses only ¢zs, virA, virBl, 
virB2, and virB3 and has overall identity of 
93% to the vir locus of type I Ti plasmids. We 
speculate that the virA allele of this fragment 
is necessary to complement the disrupted 
allele of the primary vir locus. In addition, 
virE]-2 and GALLS genes are interchange- 
able submodules that are frequently inherited 
separately from the rest of the vir locus. This 
was observed for virE]-2 genes among type 
IV-VI Ti and type II Ri plasmids (fig. S12). 
Likewise, the vir locus of type III Ri plasmids, 
despite having a common ancestor with that 
of type I Ti plasmids, carries GALLS instead 
of virEI-2. 

T-DNAs are extraordinarily diverse in size 
as well as composition and are highly re- 
combinogenic (Fig. 4B, fig. S13, and data S6). 
T-DNA transfer is largely dependent on a 
short right border sequence and a flanking 
overdrive enhancer sequence (fig. S14) (23-25). 
Border sequences are practically invariant 
among 213 T-DNAs, and this strict conser- 
vation of short sequences is a low barrier for 
generating alternative T-DNA-vir combina- 
tions and multiplexing T-DNAs in plasmids 
(fig. S14, A and B, and data S6). Diversifica- 
tion can be driven by gene gain and loss from 
T-DNAs with little to no consequence to the 
transformation process. Chimerization is an- 
other frequently permitted mechanism of 
diversification (figs. S15 and S16). The orig- 
inal T-DNA of the type I.a Ti plasmid (e.g., 
pTiC58) was extended, from left to right, by 
invasion of two additional T-DNAs. The last 
of the T-DNAs is one of two prominent var- 
iants that invaded and swept through the 
type I-IV Ti plasmids, potentially because of 
a selective advantage over others. This is the 
path from acs through the 60 gene to the right 
border that cuts prominently across the gene 
synteny graph. The type II Ti plasmid has two 
left borders in T-DNA-1 because a second 
T-DNA displaced all but the most left-flanking 
acs gene of the original T-DNA. T-DNA-2 of 
the type VI Ti plasmid has three right border 
sequences and two sets of homologous, but 
nonparalogous, oncogenes. This T-DNA follows 
a complex path in the graph. 

However, within this dataset, there is little 
evidence for exchange of T-DNAs between 
classes of oncogenic plasmids. With the ex- 
ception of T-DNA-2 of type III Ri plasmids, 
T-DNAs of the Ti and Ri classes have distinct 
gene compositions and segregate into differ- 
ent regions of the gene synteny graph prior 
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Fig. 3. Variations within and between types and subtypes of oncogenic plasmids. (A) Visualization of 
variation within plasmid subtypes. Circles, starting from the innermost circle, are a gene synteny graph, a 


plasmid map, and bar graphs representing relative depth of coverage of sequencing reads (sliding window of 
kb), number of SNPs (sliding window of 1 kb), and number of soft-clipped reads (25 clipped reads). Nodes 
in the synteny graph represent genes present in =1 plasmid of a type. 
adjacent in at least one plasmid. The network cycle represents the most common order of genes within a 
type. Different structures, such as gene presence/absence variation, rearrangements, or inversions present in 
plasmids, are shown as alternative paths in the cycle. Nodes are colored according to key functions and 
aligned to corresponding features in the plasmid map, or colored yellow for transposase- and insertion 
sequence-encoding genes. Soft-clipped reads are those that must be trimmed in order to align to the 
eference and can be evidence for a recombination breakpoint. Type |.a and type |.b Ti plasmids are 
presented as examples. (B) General structure and organization of type II-VI Ti plasmids, as well as their 


odes are connected if genes are 


subtypes, and type I-III Ri plasmids. Simplified maps show locations and spatial relationships of key modules. 
They are colored and labeled with a letter code: A, acc; O, opine genes; R, repABC; T-1 to T-4, T-DNA 1-4; 
T,, tra; Tp, trb; V, vir genes. The triangle represents an insertion sequence. Visualizations are as shown 

in (A) for each of the plasmid types and subtypes (figs. S5 to S9). 


to converging on a common subset of opine 
synthesis genes (Fig. 4B). 

The structural arrangement of modules 
affects the degree of flexibility of oncogenic 
plasmid types. Most T-DNAs have a gene for 
synthesizing an opine; opines are conjugates 
between a keto acid and an amino acid (K-A), 
a sugar and an amino acid (S-A), or two sugars 
(S-S). All oncogenic plasmids have at least one 
T-DNA with an opine synthesis gene, and plas- 
mid classification traditionally relied on the 
opine synthesized, despite assumptions of fre- 
quent exchange of opine genes (Fig. 4). Al- 
though variation of opine genes does exist, 
there is no evidence for the generalization of 
rampant exchange among all types of plas- 
mids (Fig. 4C). Opine variation is influenced 
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by the proximity of genes to the right borders 
of T-DNAs. When the cognate opine anabo- 
lism and catabolism/uptake genes are closely 
linked and separated by only the right border, 
swapping of opine loci and the border se- 
quence can occur with little constraint (figs. 
S5, S6, S14, S17, and S18). Type I.b Ti plasmids 
are very promiscuous (fig. $5). We hypothe- 
size that swapping is mediated by a process 
that involves nonhomologous recombination 
within oncogenes. In type III Ti plasmids, 
exchange is predicted to be mediated by ho- 
mologous recombination among oncogenes 
(fig. S6). Because K-A and S-A genes are as- 
sociated with different sets of oncogenes, 
swapping is most frequently limited to with- 
in their structural groups of opines (fig. S17 
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and data S6). Type I.a Ti plasmids are struc- 
turally similar to type I.b Ti plasmids and 
have the potential to diversify opines but are 
far more limited. We suggest that the low 
diversity is a consequence of the ancestral 
type I.a Ti plasmid experiencing a bottleneck 
along with BV2 (Fig. 1). 

In other types of plasmids, important func- 
tional modules separate opine genes, and 
multiple recombination events would need 
to occur to acquire the two interdependent 
modules. For example, opine genes of type IT 
plasmids are interrupted by tra genes (fig. S6). 
In type IV.a Ti plasmids, genes necessary for 
anabolism and catabolism of nopaline are 
separated by those involved in the catabo- 
lism of a second opine, agrocinopine (fig. $7). 
The acs gene, cognate to acc and necessary 
for synthesizing agrocinopine, is within and 
adjacent to the left border of the T-DNA (Fig. 
4B and fig. S13). This organization is restrictive; 
for opine swapping to occur, either a larger 
fragment that included most of the T-DNA 
would also need to be exchanged, or the two 
modules for nopaline would need to be ac- 
quired separately. 

The t7b locus is associated with a high num- 
ber of single-nucleotide polymorphisms (SNPs) 
and has diverse alleles (figs. S3 to S9 and S19). 
These data are consistent with the possibil- 
ity of recombination within the trb locus, and 
because trb is adjacent to opine loci, it likely 
contributes to the swapping of opine cassettes 
in many types of Ti plasmids. Some type III Ti 
plasmids have additional copies of trb genes 
located distal to the main ¢7b locus, likely due 
to duplication resulting from recombination 
within the locus (fig. S18B). In contrast, in 
the Ri class, trb is paired with tra and is sep- 
arated from opine loci by large regions with 
no genes involved in virulence or plasmid 
maintenance (fig. S9). This difference in spatial 
relationship between classes likely constrains 
recombination between Ti and Ri plasmids. 
However, tra and trb could mediate recombi- 
nation with homologous loci of the diverse 
non-oncogenic plasmids that are prevalent in 
the ARC (fig. S20). 

All findings were synthesized to model the 
evolution of oncogenic plasmids (Fig. 5). Ti 
and Ri plasmids are distantly related and are 
hypothesized to have emerged from an ances- 
tral proto-oncogenic plasmid (73, 26). We hy- 
pothesize that this ancient replicon carried 
vir, tgs, acs, and an opine synthesis gene 
flanked by T-DNA borders (fig. S21 and data 
S8). The tgs gene is a paralog of tpt (tmr) that 
is distal to the T-DNA and not transferred 
into plant cells (27). The tzs and ipt genes en- 
code isopentenyltransferases, enzymes that 
catalyze the first step in the synthesis of the 
plant growth-promoting hormone cytokinin 
(28). However, cytokinins derived from Tzs 
have been implicated in regulating vir gene 
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expression and promoting virulence (29, 30). 
The proto-oncogenic plasmid was suggested 
to have originated as a catabolic plasmid. The 
largest set of genes inferred to have a shared 
evolutionary history includes 52 genes (36% of 
those analyzed), of which 10 are implicated in 
the uptake of nutrients or metabolism and 
another 18 have unknown functions (data S8 
and figs. S22 and S23). This set also includes 
repABC. We propose that the proto-oncogenic 


plasmid led to either the Ti or Ri class of plas- 
mid and recombined with a separate repABC 
plasmid to yield the other class. 

The next innovation was the acquisition of 
oncogenes within T-DNA border sequences. 
Every one of the sequenced Ti plasmids has 
at least one T-DNA that circumscribes tmsI 
and tms2 (aux2/iaaH), and all Ri plasmids 
have at least one T-DNA with tmsI (Fig. 4B 
and data S6). The t7ms2 gene cooperates with 
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tms] in auxin biosynthesis (72). Data are con- 
sistent with the ancestors of Ti and Ri plasmids 
independently acquiring tmsZ from different 
nonagrobacterial sources (fig. S24:). Moreover, 
even though tmsI and tms2 are a functional 
module and are consistently linked in T-DNAs 
of Ti plasmids, t7s2 was potentially acquired 
separately from tmsI by an ancestral Ti plas- 
mid (fig. S24C). The evidence suggests that in 
Ti plasmids, ipt was derived from a duplica- 
tion of tgs and the paralog was incorporated 
within the T-DNA region (fig. $21). 

An unexpectedly small number of events are 
sufficient to relate plasmid lineages (Fig. 5 and 
figs. $3, S4, and S12 to S24). Single-copy genes 
of oncogenic plasmids were grouped into just 
18 sets inferred to have similar evolutionary 
histories (fig. S23). Five sets were sufficient 
to represent 80% of the genes analyzed. In 
contrast, tra and trb genes are distributed 
across 50% of the sets. Type IV.c Ti plasmids 
are cointegrates of a type I Ti plasmid and a 
Ri plasmid. The two sequenced type IV.c Ti 
plasmids have edges to every type I.a node, 
are nearly twice the size of pTiC58, and have 
a second tra locus located between two vir 
loci (Fig. 1A and fig. S10). In a modification 
to a previously proposed model, we suggest 
that all subtypes of the type IV plasmids are 
derived from the same lineage of the co- 
integrated ancestor and that type IV.a and 
type IV.b Ti plasmids are streamlined variants 
(15). The type VI Ti plasmid is also likely a 
cointegrate. This plasmid is novel in that it 
horizontally acquired, via T-DNA invasion, a 
second set of tmsI and tms2, which is unlike 
the first set found in all types of Ti plasmids 
(fig. S24). Type II Ti plasmids are hypothesized 
to have recently emerged from a rearrange- 
ment event within the type III Ti plasmids. 
Type V Ti plasmids are more closely related to 
Ri plasmids and are the most genetically and 
structurally distinct of the Ti plasmids. The 
multiple changes and mosaic structure make 
its evolutionary history difficult to interpret. 
Its unique structure is predicted to limit op- 
portunities for productive recombination events 
with other oncogenic plasmids. Although Ti 
and Ri classes are very distantly related, they 
can co-reside in cells and exchange regions. 
Strain Dil411, for example, carries a type I.a 
Ti plasmid and a type II Ri plasmid. 


Modeling disease spread 


We next used this evolutionary framework 
to identify epidemiological patterns among 
strains collected across the world and span- 
ning nearly a century of collection. The frame- 
work allowed us to analyze strains and plasmids 
as independently transmitted entities to ac- 
curately model disease spread. An additional 
66 genome sequences of hierarchically sam- 
pled strains were used to guide grouping of the 
agrobacterial strains and oncogenic plasmids 
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to ranks below species and subtypes, respec- 
tively (data S10). Strains of G1, G4, G7 (three 
species-level groups), and G8 (three species- 
level groups) of BV1 as well as BV2 were di- 
vided into 124 unique genotypes. Type I-III 
Ti plasmids were divided into 40 distinct plas- 
mid clusters (data S11 to S19). 

Multiple patterns underscore aspects of ag- 
ricultural ecosystems that promote the diver- 
sification and spread of pathogens (Fig. 6 and 
fig. S25). Nonpathogenic, plasmid-lacking strains 
were isolated from healthy and diseased in- 
dividuals (data S12, S15, S16, and S17). For 
example, three nonpathogenic strains were 
cultured from symptomatic plants at facility 
C7_N18, a location that also had pathogenic 
strains, differing by ~130,000 to ~290,000 SNPs 
(Fig. 6A and data S17). As potential recipients 
of virulence plasmids, nonpathogenic strains, 
which are common in soils and in association 
with plants, represent a standing pool of ge- 
netic diversity for the evolution of new path- 
ogen genotypes. Strain-plasmid combinations 
can persist over the long term within agricul- 
tural ecosystems. Strains LMG 267 (type II 
Ti) and B140/95 (type II Ti) were collected 
60 years apart yet are members of the same 
genotype and plasmid cluster, with 7 and 0 SNP 
differences, respectively (Fig. 6B and fig. S25B). 
Pathogens can be transmitted between agri- 
cultural and unmanaged ecosystems. Three 
members of a BV2 genotype (type Ia Ti) linked 
anatural ecosystem, an agricultural ecosystem, 
and an undocumented location in a different 
country (Fig. 6C and fig. S25C). The three 
strains and their Ti plasmids have <5 and 
0 SNP differences, respectively, among them. 
S6_N30 and S6_N25 are almost 240 km apart, 
but two other plant production facilities are 
located within 1.5 km of S6_N30 and could 
have bridged the link to S6_N25. 

Agricultural ecosystems can have recur- 
rent infections. Locations in which different 
genotype-plasmid combinations were detected 
had likely experienced independent infections. 
Facility S2_N7 had eight strains representing 
seven different genotype-plasmid combina- 
tions (Fig. 6D and fig. S25, I to M). Over a 12-year 
span, S8_N32 was infected by Gl (type III Ti) 
strains that have >50,000 and >350 SNP differ- 
ences among strains and plasmids, respectively 
(fig. S25, N to P). An agricultural ecosystem 
can also have disease reservoirs; this idea is 
supported by the presence of multiple strains 
of the same genotype-plasmid combination on 
different individuals. Location C7_N18 had 
27 strains of the same G7 genotype (type III 
Ti) on four individuals of three different host 
species (Fig. 6A and fig. S25Q). 

There were at least seven cases in which 
global distribution of plants is hypothesized to 
have contributed to the transmission of a strain- 
plasmid combination. These are patterns in 
which genotype and plasmid nodes are con- 
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nected by multiple edges. Patterns by themselves 
are insufficient for differentiating between di- 
rect and indirect transmission routes. How- 
ever, one case is highlighted that includes a 
facility that produces plants for wholesalers 
and could be a common source. Strains belong- 
ing to a G1 genotype (type II Ti) were identi- 
fied from facility C4_N13 (Fig. 6E and fig. S25E). 
Strains of the same genotype-plasmid combina- 
tion were later identified in two other facilities. 
The strains and plasmids from these three loca- 


tions have <10 and O SNP differences, respec- 
tively, among them. The dataset has other 
instances in which sets of closely related strains 
marginally exceeding the >15 SNP difference 
threshold have plasmids that belong to the 
same cluster. It is thus possible that there 
were more strain-plasmid transmissions than 
reported. 

Horizontal transmission of plasmids greatly 
diversifies and amplifies the spread of patho- 
gens. Fifteen networks have a plasmid cluster 
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as a central hub connected to multiple geno- 
type nodes (fig. S25). We use the simple case of 
the type La Ti plasmid of strain Dil411 as an 
example (Fig. 6F and fig. S25F). This plasmid 
has only two SNP differences relative to the 
type La Ti plasmid of another BV2 strain. In 
contrast, the two strains linked by the type I.a 
Ti plasmid cluster have more than 22,000 SNP 
differences in their chromosomes. The most 
pronounced example has 28 BV2 strains 
and one G1 strain collected from more than 
20 locations that have type I.a Ti plasmids 
with no more than 13 SNP differences among 
them (fig. S25R and data S11). The clustering of 
these 29 plasmids is not a consequence of low 
SNP diversity. The total of 49 type I.a Ti plas- 
mids in this study formed 10 unique clusters 
that vary by nearly 5000 SNP differences. The 
diversity of BV2 strains is also not a conse- 
quence of the conservative threshold used to 
define genotypes. Even at 1000 SNP differences, 
the BV2 strains would separate into 17 differ- 
ent genotypes. 

Management practices in agricultural ecosys- 
tems have been suggested to increase opportu- 
nities for plasmid conjugation (37). Supporting 
evidence is found in three networks in which 
the same location is mapped to a plasmid node 
and multiple associated genotype nodes. Strains 
of a G1 and G7 genotype carrying type HI Ti 
plasmids with 0 SNP differences were identi- 
fied in facility S9_N46 (Fig. 6G and fig. S25G). 
In the second example, one strain of a BV2 
genotype and three strains of a Gl genotype 
were collected from facility S2_N7 (fig. S25K). 
Their type II Ti plasmids have 0 SNP differ- 
ences among them. Last, a strain of a G4 geno- 
type and two strains of closely related BV2 
genotypes have type I.a Ti plasmids with <1 SNP 
difference (fig. S258). 

Plasmid dissemination can also lead to the 
temporal spread and persistence of disease. 
The type I.a Ti plasmid of strain K27, collected 
prior to 1964, has <2 SNP differences relative 
to seven other type Ia Ti plasmids present in 
strains collected during the period 1995-2009 
(Fig. 6H and fig. S25H). As previously stated, 
this is likely not a reflection of low SNP di- 
versity within the type I.a Ti plasmids. 

We found one instance in which two closely 
related strains have distantly related Ti plas- 
mids, suggesting independent acquisition 
events. Strain Z4/95 and strain K27 differ by 
only 48 SNPs (fig. S25, L and T, and data S19). 
The type I.a Ti plasmid of Z4/95 has ~2400 SNP 
differences in comparison to the type I.a plas- 
mid of K27 and does not cluster with any other 
type I.a Ti plasmid (data S11). 


Discussion 


Modularity confers phenotypic robustness 
and allows oncogenic plasmids to diversify. 
But diversification is opposed by selective 
forces that preserve genetic and physical links 


Weisberg et al., Science 368, eaba5256 (2020) 


necessary to maintain functionality of the 
plasmids. By accounting for variation and 
constraints, we were able to sort oncogenic 
plasmids into defined lineages and infer their 
evolution. We revealed the most commonly 
observed types, which are foundational for 
understanding types yet to be discovered. 

Diversification of pathogen populations by 
plasmid transmission is promoted by the ag- 
ricultural industry, where large populations 
of susceptible host species are intensively 
managed and often clonally propagated with- 
in the same facilities, individuals are moved 
across the world, and disease can persist un- 
detected for extended periods of time. Although 
clinical settings have some similar features that 
promote diversification, antibiotic use is sug- 
gested to result in a recurring pattern of clonal 
expansion of pathogens followed by global 
spread of “super-fit” bacterial lineages (32). 
However, analysis of plasmids has shown that 
recombination of antibiotic resistance genes 
can occur, and entire plasmid sequences need 
to be studied to avoid drawing misleading 
conclusions (33). In addition, transmission of 
plasmids can occur within and even between 
genera, enhancing and promoting new epi- 
demics (34, 35). Our strategy relied on multiple 
methods to determine plasmid relationships 
and coupled it with ancestry of chromosomes 
to unravel the mosaicism of bacterial evolu- 
tion. This approach has applications to other 
systems in modeling the impact of mobile ge- 
netic elements on bacterial evolution to limit 
risks from infectious diseases. 
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STRUCTURAL BIOLOGY 


The structure of human CST reveals a decameric 
assembly bound to telomeric DNA 


Ci Ji Lim’?, Alexandra T. Barbour’, Arthur J. Zaug'?°, Karen J. Goodrich, Allison E. McKay", 


Deborah S. Wuttke?, Thomas R. Cechh?:3* 


The CTC1-STN1-TEN1 (CST) complex is essential for telomere maintenance and resolution of stalled replication 
forks genome-wide. Here, we report the 3.0-angstrom cryo-electron microscopy structure of human CST 
bound to telomeric single-stranded DNA (ssDNA), which assembles as a decameric supercomplex. The 
atomic model of the 134-kilodalton CTC1 subunit, built almost entirely de novo, reveals the overall architecture 
of CST and the DNA-binding anchor site. The carboxyl-terminal domain of STN1 interacts with CTC1 at two 
separate docking sites, allowing allosteric mediation of CST decamer assembly. Furthermore, ssDNA appears to 
staple two monomers to nucleate decamer assembly. CTC1 has stronger structural similarity to Replication 
Protein A than the expected similarity to yeast Cdcl3. The decameric structure suggests that CST can 
organize ssDNA analogously to the nucleosome’s organization of double-stranded DNA. 


TCI-STNI1-TEN1 (CST) is a protein com- 

plex essential for telomere replication 

(1-4) and as a DNA polymerase alpha- 

primase (Pol-q) cofactor (5), and it func- 

tions genome-wide to recover stalled 
replication forks (2, 6-9) and facilitate DNA 
damage repair (10-13). Consequently, muta- 
tions in CST are the basis of human genetic 
diseases such as Coats plus syndrome and 
dyskeratosis congenita (14-19). 

Although it preferentially binds short telo- 
meric single-stranded DNA (ssDNA) (20-23), 
CST can also bind less specifically to longer 
ssDNA (2, 4). An intact heterotrimeric CST 
complex is necessary for its DNA-binding 
function (4, 15, 24), but limited understand- 
ing of mammalian CST architecture has ham- 
pered the determination of its DNA-binding 
region(s). Structures of human components 
are limited to STN1 and TEN] (24), and solving 
the structure of the largest subunit CTC1 has 
been technically challenging, with only a sin- 
gle domain being determined (25). The yeast 
Cdci13 protein associates with Stnl and Tenl 
and has therefore been proposed as a CTC1 
homolog, despite Cdc13 and mammalian CTC1 
being unrelated in sequence. Hence, it has 
been unclear if Cdc13 and CTC1 share struc- 
tural homology. 


Cryo-electron microscopy (cryo-EM) structure 
of human CST decameric supercomplex 


We solved the structure of purified recombi- 
nant human CST protein (hereafter termed 
“DNA-free CST”) to 6.3-A resolution using 
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single-particle cryo-EM (fig. S1). To improve 
the resolution of the structure, we added a min- 
imal telomeric ssDNA [3xTEL, (TTAGGG)s] (4) 
to the purified CST protein and unexpectedly 
discovered a symmetric complex that was con- 
siderably larger than the monomeric CST (Fig. 
1A and fig. $2, A and B). Subsequent cryo-EM 
processing revealed that the symmetric com- 
plex was a decameric supercomplex (10 CST 
monomers) with D5 symmetry, which was re- 
constructed at 3.0-A global resolution (Fig. 1A 
and fig. S2C). The CST monomers were com- 
putationally extracted from the supercom- 
plex and further sorted to obtain a final set 
of data, which led to the cryo-EM map of a 
CST monomer still at 3.0-A global resolution 
(Fig. 1B and figs. S2D and S3 to S5) but with the 
map quality substantially improved (fig. S5A). 
This enabled us to dock all the available crystal 
structures of the domains of human TEN1, 
STNI (24), and a central OB (oligonucleotide- 
oligosaccharide-binding fold) domain of CTC1 
(25) with high confidence (Fig. 1B). Further- 
more, we were able to build de novo the re- 
maining unsolved body of CTC1 [894 residues, 
excluding the one CTC1 OB domain previously 
solved (25)] (fig. S5B). 

Human chromosome ssDNA telomeric over- 
hangs are 50 to 200 nucleotides (nt) long, much 
larger than 3xTEL, so we tested CST binding to 
15xTEL (90 nt). CST bound 15xTEL with sixfold 
higher affinity than 3xTEL (fig. S6, A to C, and 
table S1), and decameric CST supercomplexes 
were readily apparent by negative-stain EM (fig. 
S6D). Thus, the decamer can form with both 
long and short telomeric ssDNA molecules. 


Overall architecture of human CST 


Model building revealed the overall architec- 
ture of the human CST heterotrimer (Fig. 1, 
C and D; fig. S5B; and table S2). CTC1 is com- 


posed of seven tandem OB domains (OB-A 
through G; Fig. 1, C and D). The human CST 
complex has a subunit stoichiometry of 1:1:1, 
unlike the nonuniform stoichiometry reported 
for the Candida glabrata CST complex (26). 

The C terminus of CTC1 (OB-D through G) 
serves as a hub for STN1 and TENI assembly 
(Fig. 1D). A single STN1 protein has two sep- 
arate interaction sites with CTC1, with the 
STNI N-terminal half (STNin) interacting with 
CTC1 OB-G and the C-terminal half (STNI1c) 
with CTC1 OB-E (Fig. 2A). These two halves of 
STN1 are connected by an unstructured peptide 
linker of seven residues (Fig. 2A). In contrast to 
the related ssDNA-binding protein, Replication 
Protein A (RPA) (27, 28), there is no triple-helix 
bundle stabilizing the heterotrimeric CST com- 
plex (Fig. 2B). Instead, TEN1 binding to CTC1 
is bridged by STNIn [similar to the model of 
the Tetrahymena CST (29, 30)] (Fig. 2B), with 
STNIn binding to a highly conserved interac- 
tion patch on CTC1 OB-G (fig. S7A). 

The first winged helix-turn-helix (wHTH) 
domain of STNic interacts with CTC1 OB-E 
(Fig. 1D). However, no strong conservation 
of residues occurs on the interaction patch 
of CTC1 OB-E (fig. S7A), suggesting that STNIc- 
CTC1 interaction could be weaker than STNin- 
CIC interaction, as reported for the Tetrahymena 
CST complex (30). Supporting this hypothesis, 
we found that STNin alone was able to in- 
teract with CTC1, but STNic could not (fig. $7, 
B and C). In addition, TEN1 interaction with 
CTC1 was maintained with STNin but lost 
when only STNIc was present. STNin and CTC1 
interact through two regions—CTC1 “cleft re- 
gion” (the conserved patch on CTC1; Fig. 2B 
and fig. S7A) and a new CTC1-STNIn three- 
helix bundle (Fig. 2C). The importance of the 
cleft and the three-helix bundle for CTC1- 
STNI association was confirmed by muta- 
genesis (fig. S7, D and E). 

CTC1 OB folds E, F, and G are arranged spa- 
tially on OB-D, which acts like a scaffold, re- 
sulting in these four OBs forming a ringlike 
structure (Fig. 1D). Structural homology anal- 
ysis of individual CTC1 OB domains found 
CTC1 to be most similar to RPA and Teb1 (an 
RPA-like paralog in Tetrahymena) (fig. S8), 
with CTC1 OB-F most similar to Tebl’s OB-B 
(31) and OB-G similar to OB-C of RPA70 or 
Teb1 (27). CTC1 OB-G also has a conserved zinc 
ribbon motif like that of the OB-C domains of 
RPA and Teb! (27, 37) (fig. S9). The scaffolding 
OB-D has no convincing structural homolo- 
gies, but given its distinctive extended OB-fold 
structure (Fig. 1D), it could be an evolved form 
of the more compact and conventional OB- 
fold (32). Notably, despite the long-standing 
suggestion that yeast Cdcl3 and mammalian 
CTC1 are homologs (33), we found weaker struc- 
tural homologies to Cdc13 than with the best 
RPA70 homology matches (based on DALI 
structural homology Z-score, fig. S10). 
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Fig. 1. Cryo-EM structure of human CST decameric supercomplex and its 
architecture. (A) Cryo-EM density of decameric CST complex colored by 


segmented CST monomers. (B) Docking of available atomic models of a CTC1 
OB-domain [*reported as central domain OB-fold (25), PDB 5W2L], STNIn (N- 
terminal half, PDB 4JOI:A), STNIc (C-terminal half, PDB 4JQF), and TEN1 (PDB 


Hinge 
three-helix bundle 


4JOI:C). (C) Structure-based schematic of CST domain architecture and 
intermolecular interactions between subunits. The individual OB domains of 


Bottom-up view 


( Hinge 
7° three-helix 
STNin bundle 
ages 


a 
linker 


STNic 


Top-down view 


CTC1-STNin three-helix bundle (3H2) 


Fig. 2. CST intersubunit interactions and CTC1 molecular motifs. (A) STN1 N-terminal and C-terminal 
halves—STNin and STNic—interact separately with CTC1 by means of a flexible peptide linker. (B) STN1 an 
TEN1 do not interact with CTC1 using a trimeric helix-bundle like human RPA; instead, STN1 directly interacts 
with a highly conserved patch on CTC1 (Cleft) and bridges TEN1 to CTC1. (€) CTC1-STN1n three-helix bundle that 
is involved in CTC1 and STN1 assembly. ol (highlighted dark gray) is from STNIn and a2 and a3 (highlighted 
bright pink) are from CTC1 OB-G. (D) The hinge three-helix bundle (annotated a1, «2, and a3) connects OB-A, 


-B, and -C to the rest of the C-terminal OB-domains of CTC1. 
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d 


CTCl1 are rainbow colored. (D) CTC] architectural organization of seven OB 
domains (A to G) and the identified bound-ssDNA (space-filled model). Spokes in 
(A) are STNic, whereas the legs are CTC1 OB-A, -B, and -C. 


We also found an intramolecular three-helix 
bundle bridging OB-C and OB-D (Fig. 2D). This 
three-helix bundle (termed hinge three-helix bun- 
dle) effectively segregates OB-A, -B, and -C from 
the C-terminal OB domains (Fig. 1, C and D). 
CST OB-C serves as a scaffold for both OB-A 
and OB-B (Fig. 1D). Because of extensive flexi- 
bility of OB-A, we could only de novo build a 
poly(alanine) model (~45 residues) for it, with 
the overall backbone of OB-A clearly showing 
the OB-fold topology. Structural homology 
searches of OB-B and OB-C reveal them to be 
most similar to Ustilago maydis RPA70 OB-A 
and OB-B (28) (fig. S8). The multiple structural 
homologies of human CTC! to various domains 
of RPA70 suggest that CTC1 may have evolved 
from RPA (figs. S8 and S10). 

Disease mutations in CTC1 that have been 
shown to interfere with Pol-o binding (75) are 
located on CTC1 OB-B (A227 and V259) and on 
scaffold OB-D (V665) (fig. S11). Given that Pol- 
a has a bilobal architecture (34), the catalytic 
and primase lobes of Pol-a could engage CST 
at separate sites. 


CST ssDNA-binding anchor site 


Four nucleotides, TAGG, were clearly visible in 
the cryo-EM map of the complex and not in 
the DNA-free CST cryo-EM map (Fig. 3, A to 
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Fig. 3. Telomeric ssDNA-binding anchor site of CST. 


(A) A 4-nt segment of the single-stranded telomeric DNA is 
located on CTC1. (B) Coulombic surface analysis (36) reveals 
that the ssDNA anchor site is highly positively charged (blue; 
red is negatively charged surface). (€) Cryo-EM density of 
the ssDNA molecule built with the sequence assigned as TAGG 
(5'-T1-A2-G3-G4-3'). The numbering is based on the visible 
ssDNA, not the full-length ssDNA. (D) CTC1 residues involved 
in ssDNA binding are shown in yellow and cyan from OB-F 


(yellow on purple) and OB-G (cyan on pink), respectively. 


(E) Schematic of CST ssDNA-binding anchor site across CTC1 
OB-F and OB-G. (F) Gel-shift assay showing that CST DNA- 
binding mutants predicted from the atomic model no longer 
bind telomeric ssDNA (TTAGGG)3. Wedges indicate twofold 
dilutions of CST starting at 50 nM, with the fifth lane of each 
group having no protein added. K743E/R744E mutant does not 
directly bind DNA and was used as a control to test if charge 
swaps in the vicinity might be sufficient to destabilize DNA 


983 R978 S977 N9B1 Specific interactions 


Non-specific interactions 


Hy 


binding. (G@) CST DNA-binding mutants can still form hetero- 

trimeric CST complex as shown by tandem immunoprecipi- F di ee VE67A) astgae/ K7A9E/ 

tation pull-down assays [FLAG/hemagglutinin (HA)] from WT —-ygggaqOK1167E—ugaga-RII95E-R744E 

exogenously expressed FLAG-CTC1, MYC-STN1, and HA-TENI. —_—_—S=——__—_— ES 

Asterisk (*) indicates protein degradation product. Wedges Prey fem 
o Cewe” Heee||"HHen “em 


indicate a twofold dilution that is used to ensure that Western- 


blot band intensities are in the linear detection range. 


(H) Human CTC1 disease mutations (15) that abolish ssDNA 


binding (lime green residues) are located near the ssDNA 


anchor site. Abbreviations for the amino acid residues are as 
follows: A, Ala; C, Cys; D, Asp; E, Glu; G, Gly; H, His; K, Lys; 


L, Leu; N, Asn; R, Arg; S, Ser; V, Val; W, Trp; and Y, Tyr. 


C; see table S3 for cross-correlation analysis, 
fig. S12A), suggesting that these interact with 
the protein most strongly or with highest occu- 
pancy. The rest of the ssDNA is likely flexible 
or bound to CST in multiple binding modes, 
which is consistent with CST being able to 
bind multiple configurations of ssDNA dynam- 
ically (4, 7, 23). Hereafter, we identify the site 
of ssDNA binding on CTC1 as the ssDNA an- 
chor patch. 

Several positively charged residues of CTC1 
are involved in the interaction with ssDNA at 
the anchor patch (CTC1 R978, K1164, and K1167), 
as well as additional aromatic and neutral-polar 
residues (CTC1 Y949, N981, and Y983) (Fig. 3D 
and E). This anchor patch uses several kinds 
of interaction between CTC1 and ssDNA: for 
example, R978 and N981 hydrogen bonding to 
the negatively charged ssDNA phosphate back- 
bone (fig. S12B); K1164 and K1167 hydrogen 
bonding to ssDNA bases (fig. S12C); and Y949 
m-nm stacking with the A2 base, which in turn is 
stacked on the TI base (fig. S12D). The tyrosine- 
base-base stack is reminiscent of the stacking 
arrangements seen in several OB fold-nucleic 
acid interactions, e.g., the human POTI-ssDNA 
structure (20). The nonspecific interactions 
mostly involve CTC1 OB-F, whereas specific 
interactions are in OB-G (Fig. 3E); the modeled 
4-nt ssDNA spans these two OB domains, sug- 
gesting that ssDNA binding stabilizes CTC1 
architecture (Fig. 3D). In addition, these ssDNA- 
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interacting residues are highly conserved across 
mammalian CTC1 homologs (fig. $13). 

To validate the observed protein-DNA inter- 
actions, we performed mutagenesis on sets 
of CTC1 residues in the ssDNA anchor patch— 
R987E/N981D/Y983A (anchor site on OB-F), 
K1164E/K1167E (anchor site on OB-G), V967A/ 
S979A/H980A (structural integrity residues on 
OB-F), and R1193E/R1195E (structural integrity 
residues on OB-G) (see fig. S14 for structural 
mapping of additional tested mutants). Each 
of these sets of mutations abolished CST DNA- 
binding activity, whereas the K743E/R744E 
negative control mutation did not (Fig. 3F). In 
all of these mutants, CST still forms a hetero- 
trimer complex (Fig. 3G). Previously identified 
CTC1 disease mutations (R975G, C985A and 
R987W) that have been shown (/5) to affect 
CST-ssDNA binding are also in the vicinity of 
the ssDNA anchor patch (Fig. 3H). 


Assembly mechanism and pathways 
of decameric CST supercomplex 


We identified interactions that appear to me- 
diate decameric supercomplex assembly. The 
sites can be categorized into two oligomer- 
ization classes (Fig. 4, A to C): (i) dimerization 
(dihedral dimerization) and (ji) tetrameriza- 
tion (two subclasses: adjacent and diagonal). 
For CST dimerization, three conserved resi- 
dues at the interface are N745, L843, and 
R1175. R1175 is particularly interesting, given 


that it is also within range (<5 A) for interac- 
tion with the phosphodiester groups of T1 or 
A2 of the opposite dihedral dimer’s telomeric 
ssDNA (Fig. 4A), which suggests ssDNA bind- 
ing can also stabilize CST dihedral dimeriza- 
tion. Consistent with this prediction, CST with 
the CTC1 R1175E mutation showed a 26-fold 
reduction in DNA-binding ability with 3xTEL 
ssDNA but no effect when tested with a non- 
specific T18 (poly-T) ssDNA (fig. S15). 

For tetramerization, CTC1 interacts with its 
diagonally opposite neighbor’s STNin (CTC1 
E1183) (Fig. 4B) and adjacent neighbor’s TEN1 
(CTC1 H484 and R624) (Fig. 4C). The proxim- 
ity of the two ssDNA anchor patches across 
dihedral dimers (fig. SIGA) suggested that a 
single ssDNA molecule of three TTAGGG re- 
peats could “staple” together two monomers 
into a dihedral dimer, with the first and last 
repeat engaged by the CST monomers while 
the middle repeat served as a linker (Fig. 4D). 
Consistent with this model, we found that 
replacing individual TTAGGG modules with 
Tg reduced CST DNA-binding affinity for either 
the first or last repeat but was tolerated for 
the middle repeat (Fig. 4E and fig. S16, B and 
C). As an additional test, shortening the mid- 
dle repeat sequence (using oligo-T sequence 
instead of TTAGGG) to <6 nt negatively af- 
fected CST-DNA binding (Fig. 4E and fig. S16, 
D and E), consistent with the measured mo- 
lecular distance (~20 A, fig. S16A) between the 
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Fig. 4. Molecular interactions underlying CST deca- 


meric supercomplex formation and testing the 
dimer stapling model. (A to C) The reference CST 
(cyan) is flanked by four CST complexes—a dihedral 
dimer (opposite, pink) and three tetrameric partners 
(one diagonal neighbor, green, and two adjacent 
neighbors, purple). (A) CTC1 R1175 from the dihedral 


to the reference CTC1 (cyan), with the black dashed 
ines representing feasible ionic interactions between 
R1175 and phosphodiester groups of the ssDNA. 

(B and C) Identified intermolecular interactions 
between CTC], STN1, and TEN at interfaces of the 
decameric supercomplex. (D) CST dimer stapling 
model with an 18-nt ssDNA molecule. The two 


visual clarity. (E) Changes in CST DNA-binding affinity 
(Kaapparent) Felative to that of 3xTEL with oligo-T 
substitution of block A, B, or C of 3xTEL (Scramble 
series). The molecular distance between the TTAGGG 
sequences of blocks A and C was also varied, and 
the impact on CST relative DNA-binding affinity 

was measured (Spacer series). TEL-TEL-TEL oligo is 
also known as 3xTEL. The relative DNA-binding 
affinity values are reported to two significant figures; 
measured values and error analysis are in table S1. 


5‘ and 3’ ends of the neighboring ssDNA mol- 
ecules in the dihedral CST dimer. 

A comparison of the DNA-free CST model 
and the monomeric CST model extracted from 
the decameric supercomplex (for example, 
compare fig. SID to fig. S2D) suggested that 
STNiIc has two alternate docking sites on CTC1. 
To investigate this, we turned to a cryo-EM 
dataset that had a high population of mono- 
meric CST with telomeric ssDNA added (fig. 
817) and found two distinct conformations— 
one with a “head” density and the other with 
an “arm” density—albeit at a lower model reso- 
lution of ~9 A (Fig. 5, A and B). Because STNIc 
is in the “arm” conformation in the decameric 
CST, and the STNic “head” conformation would 
sterically hinder formation of the decamer 
(by obstructing dihedral dimerization), we 
propose that switching from “head” to “arm” 
docking position for STNic is an important 
first step for CST to form a decameric super- 
complex. STNic switching is consistent with 
our finding that STNIc is less stably bound 
to CTC1 than STNiIn (fig. $7, B and C). 

Surface-charge analysis revealed a highly pos- 
itively charged surface on CTC1 OB-G, where 
the STNIc is expected to dock in the “head” 
conformation (Fig. 5C), and similar analysis 
revealed a reciprocal patch of high negative 
charge on STNIc (Fig. 5C, inset). This sug- 
gested that charge-charge interactions could 
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mediate the transition from monomeric to 
decameric CST, explaining how a longer ssDNA, 
with extended binding to the OB-G’s negative 
patch, can trigger this transition. The charge- 
charge interactions also suggested that in- 
creased salt concentration could mediate the 
transition in the absence of ssDNA. Indeed, 
we found a large increase in decameric CST 
population without addition of ssDNA ina 
nonphysiological salt concentration of 800 mM 
NaCl (fig. S18). 

Finally, we used negative-stain EM single- 
particle analysis to identify subcomplexes of 
the decamer that would give hints to its as- 
sembly pathway(s). We observed two subcom- 
plexes, dimers and tetramers (Fig. 5D), which 
are plausible intermediates in an assembly 
pathway based on ssDNA-stapled dimers such 
as the following: CST assembles first as a di- 
hedral dimer before forming a lateral tetramer 
involving two dihedral dimers, and sequential 
addition of dimers eventually closes the sym- 
metric circle (decamer) by continuing the lat- 
eral oligomerization (Fig. 5E). 


Evidence for higher-order CST assemblies in vivo 


CST monomers interact specifically to form 
the decamer, burying a great deal of exposed 
protein surface area (~2100 A? per monomer), 
and ssDNA has a specific role in triggering 
decamer assembly. These features indicate 


m 


Spacer series Scramble series 


Relative 


TEL-TEL-TEL TTAGGG TTAGGG TTAGGG 1.0 
T6-TEL-TEL TITTTT TTAGGG TTAGGG 27 
TEL-T6-TEL TTAGGG TITTTT TTAGGG 3.8 
TEL-TEL-T6 TTAGGG TITAGGG) TTTTIT 39 

T-TEL-TTTT-TEL-T TTTAGG GTTTTT TAGGGT 7.9 
TT-TEL-TT-TEL-TT TTTTAG GGTTTT AGGGTT 10 
TTT-TEL-TEL-TTT TTTTTA GGGTTA GGGTTT 17 


that formation of the CST decamer is thermo- 
dynamically favorable and that the monomer 
is built to form the decamer. To confirm that 
CST forms oligomers in cells, we turned to an 
orthogonal epitope tag pull-down approach. 
V5-tagged and FLAG-tagged CTC1 were coex- 
pressed in human embryonic kidney 293T 
(HEK293T) cells, along with STN1 and TEN1. 
Pull-down using anti-FLAG beads immuno- 
precipitated V5-CTCI1, as well as FLAG-CTCI1, 
and the reciprocal experiment with anti-V5 
beads similarly recovered the CTC1 with both 
epitope tags (Fig. 5, F and G, and fig. S19, A 
and B, for IP controls). Notably, the pull-down 
result was not sensitive to DNA and RNA de- 
gradation with benzonase (Fig. 5, F and G, and 
fig. S19C), so the higher-order complexes were 
not loosely tethered by nucleic acid. We con- 
clude that a substantial fraction of CST resides 
in a higher-order protein complex, consistent 
with decamers existing in vivo. 


Discussion 


Our structure of the decameric human CST 
supercomplex bound with telomeric ssDNA 
provides the platform for understanding mech- 
anisms of various CST functions in DNA rep- 
lication and DNA damage repair, not only at 
telomeres but also genome-wide (7, 8, 11, 13, 35). 
The atomic-resolution details revealed in this 
structure enabled us to identify amino acids 
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Fig. 5. Assembly mechanism and pathway model of CST decameric supercomplex. (A) Cryo-EM 
densities of two conformations of monomeric CST with the differences indicated by dashed black circles. The 
two conformations—“head” (colored gray) and “arm” (colored pink)—are assigned as CST without and 
with ssDNA bound, respectively. (B) Cartoon models of CST “head” and “arm” conformations depicted by 
conformational changes of STNic docking site on CTC1. The black dashed line represents the unstructured 
polypeptide region between STNIn and STNic. (C) Coulombic surface analysis reveals a highly positively 
charged patch on CTC1 OB-G, where STNiIc lies when in “head” conformation. Reciprocally, a highly 
negatively charged surface is shown on STNIc (see inset). (D) Two-dimensional class averages of negative- 
stained CST incubated with 3xTEL ssDNA showed multiple oligomeric species of CST, which are assigned 
as monomer, dimer, tetramer, and decamer. (E) Proposed model of assembly pathway of CST decameric 
supercomplex upon ssDNA introduction. CST binding of ssDNA prevents STN1c from binding to its original 
site (“head” conformation, gray), allowing CST to form dimers before progressing to tetramers, and 
eventually leading to a close-ended decameric supercomplex. (F and G) Immunoprecipitation (IP) of or- 
thogonally tagged CTC1 molecules coexpressed in cells. (F) FLAG IP of HEK293T cell extracts that were 
cotransfected with V5-CTC1, FLAG-CTCI, or both, and with TEN] and STN1. Western blot with antibody 
against V5 showed that FLAG-IP of FLAG-CTC1 also coimmunoprecipitated V5-CTC1 (yellow arrows). 

(G) Coimmunoprecipitation of FLAG-CTC1 was also observed with V5-IP of V5-CTC1 (yellow arrows). STN1 
and TEN Western blots were done to determine the presence of CST heterotrimeric complex assembly. 

M and M+ indicate protein ladder PageRuler and PageRuler Plus, respectively. 
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Photoelectron spectra of alkali metal-ammonia 
microjets: From blue electrolyte to bronze metal 


Tillmann Buttersack'2*, Philip E. Mason’, Ryan S. McMullen2*, H. Christian Schewe>*, 
Tomas Martinek’, Krystof Brezina'*, Martin Crhan?, Axel Gomez*°, Dennis Hein®”, Garlef Wartner®’, 
Robert Seidel®”, Hebatallah Ali°?, Stephan Thiirmer®, Ondrej Marsalek*+, Bernd Winter*+, 


Stephen E. Bradforth?}, Pavel Jungwirth'; 


Experimental studies of the electronic structure of excess electrons in liquids—archetypal quantum 
solutes—have been largely restricted to very dilute electron concentrations. We overcame this limitation 
by applying soft x-ray photoelectron spectroscopy to characterize excess electrons originating from 
steadily increasing amounts of alkali metals dissolved in refrigerated liquid ammonia microjets. As 
concentration rises, a narrow peak at ~2 electron volts, corresponding to vertical photodetachment of 
localized solvated electrons and dielectrons, transforms continuously into a band with a sharp Fermi 
edge accompanied by a plasmon peak, characteristic of delocalized metallic electrons. Through our 
experimental approach combined with ab initio calculations of localized electrons and dielectrons, we 
obtain a clear picture of the energetics and density of states of the ammoniated electrons over the 
gradual transition from dilute blue electrolytes to concentrated bronze metallic solutions. 


ince the discovery of spectacularly col- 

ored alkali metal-ammonia solutions in 

the early 19th century, excess ammoni- 

ated electrons have attracted consider- 

able attention, as reviewed recently by 
Zurek et al. (1) [see Thompson’s classic mono- 
graph (2) for an overview of the older lit- 
erature]. Alkali metals are soluble in liquid 
ammonia up to concentrations of roughly 
20 mol % metal (MPM)—i.e., one metal atom 
per about four solvent molecules (7). A tran- 
sition from a blue electrolyte to a bronze- or 
gold-colored metallic solution upon increasing 
alkali metal concentration is accompanied by 
a liquid-liquid phase separation at sufficiently 
low temperatures (/-7). The nature of the me- 
tallic transition in both liquid and crystalline 
alkali metal-ammonia systems, directly evi- 
denced by an orders-of-magnitude increase in 
electrical conductivity, has puzzled researchers 
for decades (J, 8-10) and is not yet understood 
in molecular detail. The involved chemical 
species include dilute solvated electrons and 
dielectrons as well as their various complexes 
with alkali metal cations (7)—all gradually 
coalescing into delocalized structures and 
giving rise to a conduction band. A series of 
conferences on this topic, the Colloques Weyl, 
was organized in the second half of the past 
century, resulting in a series of articles pri- 
marily focused on the structure, thermodynam- 
ics, and electrical and magnetic properties of 
the alkali metal-ammonia solutions (11-16). 
Electrons in liquid ammonia have also been 


thoroughly studied with nuclear magnetic 
resonance (NMR) and electron spin resonance 
(ESR) techniques. The latter show a narrow 
structureless spin resonance line with a g value 
(i.e., dimensionless magnetic moment) charac- 
teristic of a free-electron spin, which broadens 
upon increasing the alkali metal concentration 
(2, 17). Shifts in the 'H and “N NMR positions 
(Knight shifts) give a measure of the unpaired 
electron spin density at all constituent nuclei 
within the orbit of the molecules solvating the 
unpaired electron (17, 18). Shkrob has argued 
(19) that the Knight parameters from ‘“N NMR, 
electron spin echo relaxation, and ESR line- 
width data can only be interpreted as a transfer 
of a substantial fraction of the spin density to 
the nitrogen atoms in the first solvent sphere. 

The principal means to explore electronic 
structure, and thus the binding energies and 
density of states of excess ammoniated elec- 
trons, is photoelectron spectroscopy (PES). 
Liquid ammonia has a great advantage over 
water in that high concentrations of ammo- 
niated electrons can be reached in solutions 
that are stable for extended periods of time 
without the danger of explosion (20). Never- 
theless, compared with the number of such 
investigations of electrons solvated in water 
(i.e., hydrated electrons) (27-23), PES studies 
in liquid ammonia are scarce. Early photo- 
electron (PE) total emission yield experiments 
led to an estimate of the PE threshold of ~1.4 eV 
(24, 25), in good agreement with electrochem- 
ical determination of the adiabatic binding 


energy of an ammoniated electron (26). This 
value is also roughly consistent with results 
from cluster extrapolations (27-31). However, 
clusters have only limited relevance to liquid 
bulk systems, as they inevitably exhibit pro- 
nounced surface effects and are typically solid 
rather than liquid (32, 33). As a result, struc- 
tures such as metastable clusters exist; these 
structures are characterized by low electron 
binding energies and have no liquid bulk ana- 
log (33). Electron scattering data from clusters 
also differ from condensed phase data (34). 
Additional insight into the ultrafast dynam- 
ics of ammoniated electrons emerged from 
femtosecond time-resolved experiments in- 
volving multiphoton photoionization in pure 
liquid ammonia or photoexcitation in dilute 
alkali metal-ammonia solutions (35-38). These 
studies have typically probed ammoniated 
electrons in the low-concentration regime (i.e., 
individual electrons well below the electrolyte- 
to-metal transition). In concentrated systems, 
plasmons in metallic lithium-ammonia so- 
lutions were explored by x-ray scattering two 
decades ago (10), and a PES study of small to 
medium-sized cryogenic sodium-ammonia 
clusters was performed recently (37). 

There is thus a clear need for a direct PES inves- 
tigation of excess ammoniated electrons that 
would cover both the electrolyte and metallic 
regimes. We have recently overcome a critical 
obstacle in collecting PEs from a volatile polar 
refrigerated liquid. We developed an experi- 
mental setup that produces a liquid ammonia 
microjet and performed PES measurements 
with this apparatus (39). In that study, we char- 
acterized the valence and core orbital struc- 
ture of pure gaseous and liquid ammonia and 
quantified the effect of the condensed phase 
environment on the orbital energies, which was 
found to be even stronger than in water, despite 
weaker hydrogen bonding in liquid ammonia 
(40). This work has paved the way for PES 
investigations mapping the electrolyte-to-metal 
transition through the study of liquid alkali 
metal-ammonia solutions of increasing con- 
centrations, as reported here. 

Electronic structure calculations enable in- 
terpretation of PES measurements of ammoni- 
ated electrons in terms of a complex structural, 
dynamical, and molecular orbital picture. So 
far, only molecular pseudopotential calcu- 
lations have been performed for electrons in 
liquid ammonia (4/, 42), with density func- 
tional theory (DFT) applied to crystalline 
alkali metal-ammonia systems (8). Although 
the early liquid-state calculations provided 
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some insight into the transition from indi- 
vidual solvated electrons through dielectrons 
[which exist as spin-paired singlet species 
in liquid ammonia (43)] to the onset of de- 
localized states upon increasing alkali metal 
loading, these findings were inevitably of a 
qualitative nature only. This was due to neg- 
lect of the explicit electronic structure of the 
solvent and to approximations made in the 
pseudopotential itself (33). We have shown 
previously that, in aqueous solutions, a quan- 
titative picture of the electronic structure of 
hydrated electrons and surrounding water 
molecules can be obtained through DFT-based 
ab initio molecular dynamics (AIMD) (44). In 
this study, we employed an extension of this 
approach, combining it with quantum chem- 
ical embedded-cluster evaluation of electron 
binding energies to characterize ammoniated 
electrons and dielectrons. 


Photoelectron spectroscopy: 
Electrolyte solutions 


PE experiments were carried out with the 
SOL?PES experimental setup (45) at the U49/ 
2-PGM-1 beamline at the synchrotron radia- 
tion facility BESSY II (46) (for details, see the 
“Experimental Methodology” section in the 
supplementary materials). PE spectra at low 
electron binding energies of microjets of 
lithium-liquid ammonia solutions at alkali 
metal concentrations ranging from 0.012 to 
9.7 MPM are presented in Fig. 1A. Analo- 
gous low-energy spectra of potassium-liquid 
ammonia solutions (0.15 to 1.25 MPM) and 
sodium-liquid ammonia (0.15 to 0.75 MPM) 
are shown in Fig. 1, B and C. Visually, the in- 
crease of alkali metal concentration is con- 
nected with deepening of the characteristic 
blue color of the solutions, with the higher 
concentrations becoming nearly black, even 
in the thin microjet, and the solution with 
the highest lithium concentration acquiring 
a discernible bronze-colored metallic sheen. 

Similarly to previously studied aqueous mi- 
crojets (47, 48), in experiments with liquid 
ammonia we observed electrostatic effects 
leading to global PE spectral shifts. These shifts 
are larger for alkali metals in liquid ammo- 
nia than for solutions of alkali halide salts at 
equivalent concentrations (49). To correct for 
these instrumental spectral shifts, the low- 
concentration spectra (0.08 MPM for Li and 
0.15 MPM for Na and K) were aligned hori- 
zontally such that the lowest-energy liquid 
ammonia peak (3a;), fitted to a Gaussian func- 
tion, was always anchored at 9.09 eV, which is 
the value of the corresponding vertical de- 
tachment energy (VDE), as determined in 
our recent PE measurements of a pure liquid 
ammonia microjet (39). All other spectra were 
aligned using the same shift. This procedure 
(see the supplementary materials for more 
details) is well justified; for low-to-moderate 
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Fig. 1. Experimental PE spectra of alkali metal-liquid ammonia solutions at various concentrations, 
as obtained by synchrotron x-ray PES in a refrigerated liquid microjet setup. (A) Li in NH3, (B) K in 
NH3, and (C) Na in NH3. Individual data points are color-coded to reflect the actual color of the solutions. 
Energy scales are shown with respect to the vacuum level. (D) Integrated peak areas from 0 to 6 eV in (A) 
(red), (B) (purple), and (C) (yellow), as a function of alkali metal concentration. arb. u., arbitrary units. 


alkali metal concentrations, the effect of ionic 
solutes on the position of the solvent PE peaks 
has been found to be negligible in water [see 
below and (47, 48)]. Nevertheless, these factors 
combine to produce a small systematic uncer- 
tainty in determining absolute values of VDEs, 
which we estimate not to exceed ~0.4 eV. 

A notable result of the present measure- 
ments is that from ~0.08 to ~1 MPM the PE 
spectra consistently show a small but clearly 
visible peak at a VDE of ~2 eV (Fig. 1). The 
integrated area of this peak is roughly linearly 
proportional to the number concentration of 
the alkali metal (Fig. 1D). The observation that 
the position of this peak essentially does not 
depend on the chemical nature of the alkali 
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metal points directly to ammoniated electrons. 
More precisely, starting from ~10-? MPM, the 
solvated electrons engage in spin-pairing to 
form dielectrons (7, 2, 50). ESR measure- 
ments provide an estimate of the concentra- 
tion dependence of the dielectron/electron 
ratio (2), which increases with dissolved me- 
tal concentration and reaches a factor of 
~10 around 0.1 MPM. The measured value 
of ~2 eV thus corresponds primarily to the 
VDE of dielectrons. 


Electronic structure calculations 


Our experimental conclusions are further sup- 
ported by electronic structure calculations. To 
model the structure of electrons, dielectrons, 
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and electron-alkali cation pairs (fig. S10) in 
liquid ammonia, we need to go beyond both 
static ab initio calculations of small clusters (57) 
and molecular pseudopotential bulk simula- 
tions (41-43). We thus combined state-of-the- 
art AIMD using the revPBEO-D3 hybrid density 
functional for sampling of relevant structures 
with subsequent second-order Moller-Plesset 
perturbation theory (MP2) for VDE calcula- 
tions. The latter calculations were performed 
for clusters carved out of the AIMD trajectory 
and embedded in a polarizable continuum 
model (PCM). See the supplementary mate- 
rials for details. 


Fig. 2. Ammoniated electron and 
dielectron simulated by ab initio 
molecular dynamics (AIMD). AIMD 
results for (A) the ammoniated electron 
and (B) the ammoniated dielectron. 
Radial electron density profiles were 
calculated from the squares of the 
corresponding Wannier orbitals 

(green filled curves) and the center 

of excess charge—ammonia nitrogen 
radial distribution functions (RDF) 

(blue curves). Normalization is such 
that the integrated excess electron 
density of the dielectron is twice that 
of the electron (the latter being 
arbitrarily set to peak at the value of 1). 
Dashed vertical lines denote the 
electron or dielectron radius of gyration. 
Inset images depict the squared 
Wannier orbitals with surrounding 
ammonia molecules in the AIMD 


simulation box. e;,,,, solvated electron. 


Fig. 3. Simulated VDE. An ammo- 
niated electron (red) and dielectron 
(purple) were modeled using 
solvation shells with 12 ammonia 
molecules carved out from 

AIMD simulations and embedded 

in a PCM. For comparison, 

the corresponding experimental 

PE spectra of the low-concentration 
ithium—ammonia solutions 

(from Fig. 1A) are shown. 


Calculated Distribution (arb. u.) 
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AIMD simulations of an excess electron in 
bulk liquid ammonia demonstrate that an am- 
moniated electron occupies a cavity coordi- 
nated by ~12 ammonia molecules and has a 
gyration radius of 3.9 A, on average (Fig. 2A), 
consistent with the value of ~3.5 A from a 
moment analysis of the optical absorption 
spectra (52). The spin-paired ammoniated 
dielectron adopts a structure similar to that 
of the electron (Fig. 2B), with approximately 
the same number of ammonia molecules in 
contact and a slightly larger average gyration 
radius of 4.4 A. In both cases, the solvent shell 
is very diffuse and lacks clear separation from 


10 1.0 

> 

= 

wn 

c 

zg rm 
c (a) 
6 a 
5 z 
U 

oO | 
o \ 
a o 
= 

é 

& 

oO 

a 

(0) 0 

1.06 1.0 

2 

wn 

5 uw 
n=} a 
c a 
§ 7 
LS) 

x a 
o 

a N 
= 

ra 

oc 

© 

ina 

0 0 


0 2 4 6 8 
Distance from excess charge center (A) 


Electron 
Dielectron 
Li 0.08 MPM 
Li 0.35 MPM 


PE Signal (arb. u.) 


25 20 15 10 05 40.0 
VDE (eV) 


5 June 2020 


the rest of the solvent. Our test AIMD calcu- 
lations show that adding a second electron of 
the same spin leads to the formation of two 
separate solvated electron cavities rather than 
a dielectron in a single cavity. 

The electron solvation structure in ammo- 
nia is qualitatively similar but quantitatively 
different from that of a hydrated electron in 
water or aqueous solution (33, 44). Namely, 
the first solvent shell of the hydrated electron 
is substantially more structured and less dif- 
fuse compared with those of the ammoniated 
electron or dielectron. Moreover, the hydrated 
electron is much smaller, with only four to six 
water molecules in its hydration shell and a 
gyration radius of ~2.5 A (44, 52). In regard to 
dielectrons, the situation in liquid ammonia is 
likely to be different from that in water, where 
hydrated dielectrons are predicted to be thermo- 
dynamically much less stable than hydrated 
electrons (53, 54). 

The AIMD simulations also serve as a basis 
for calculations of the VDE of the ammoni- 
ated electron and dielectron. First, we carved 
out the immediate electron solvation shells 
containing 12 NH; molecules from more than 
100 snapshots from the AIMD trajectories. 
These structures were then embedded in a 
PCM with the dielectric constant of liquid 
ammonia (see below and the supplementary 
materials for more details). The distributions 
of the resulting VDEs of the two species eval- 
uated at the MP2 level (without any additional 
shifts or adjustments) are plotted in Fig. 3, 
referenced against our experimental data. 
The calculated distributions have widths of 
~0.3 eV, peaking at ~2.0 eV for the ammo- 
niated electron and ~1.6 eV for the dielectron. 
In comparison with our low-concentration 
experimental spectra, we see that the exper- 
imental peak at ~2 eV encompasses within its 
width both the calculated solvated electron and 
dielectron VDE distributions (Fig. 3). These 
results are consistent with the previously 
calculated very small difference of ~0.1 eV 
between the lowest optical transitions of an 
ammoniated electron and a spin-paired di- 
electron in an idealized six-coordinated cluster 
geometry (55). The value for the ammoniated 
electron, however, differs quantitatively from 
extrapolations from ammonia clusters with an 
excess electron (29), yielding 1.25 eV. This is 
due to the fact that the cryogenic clusters are 
finite and solid and, therefore, have different 
properties from those of the bulk liquid sys- 
tems described here (56, 57). 


Photoelectron spectroscopy: From electrolytes 
to metallic solutions 


Upon increasing the alkali metal concentra- 
tion, the PE spectra exhibit a gradual conver- 
sion of the Gaussian-type solvated electron peak 
into an asymmetric band with a sharp edge 
toward lower binding energies accompanied 
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by one or two satellite peaks on the higher- 
binding energy side (Figs. 1 and 4, with details 
provided in the supplementary materials). At 
the lowest alkali metal concentrations, the 
solvated electron peak can be fitted to a Gaussian 
function (Fig. 4) with a full width at half 
maximum of about 0.45 to 0.6 eV [i.e., slightly 
narrower than the equivalent first ionization 
peak for halides in liquid ammonia (49)] and 
with a low energy onset (appearance potential) 
at ~1.5 eV, which is close to the previous esti- 
mate of 1.4 eV from the PE threshold measure- 
ments (24, 25). 

By contrast, at the highest lithium concentra- 
tion of 9.7 MPM, the PE spectrum is fitted to an 
inverse-parabola conduction band with a sharp 
Fermi edge and two plasmon peaks (Fig. 4), 
as follows directly from the free-electron gas 
model for metals (58), with an effective electron 
mass close to unity (Fig. 4 and Table 1). Owing to 
the relatively low electron density, the plasmon 
frequency is in the visible range, which gives 
the concentrated alkali metal-ammonia so- 
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Fig. 4. Analysis and fits of the PE spectra in the 
electrolyte and metallic regimes. Partial fits 

to the conduction band, plasmons, and localized 
(di)electrons are vertically offset for visual clarity. 
(A) Fit (relative root mean square error of 5.3%) of 
the liquid Na-K alloy to a free-electron gas model. 
At low binding energy, we observe a Fermi edge 
leading feature with the characteristic parabolic 
shape of the conduction band; plasmon excitations 
are seen at higher binding energies. (B) Fits (relative 
root mean square errors of 6.3, 6.8, 7.1, 8.9, and 
32.2% for 9.7, 3.4, 0.97, 0.35, and 0.08 MPM, 
respectively) of Li-NH3 data to a combination, in 
varying ratios, of a free-electron gas model with 
plasmon bands for the fraction where the electron 
is delocalized and a single Gaussian function to 
represent the localized (di)electron. (€) Evolution of 
the liquid ammonia 3a; peak upon increasing Li 
concentration. (D) Concentration dependence of the 
effective electron mass from fits in (A) and (B). 

(E) Relative peak areas corresponding to the 
localized Gaussian, the conduction band, and the 
plasmon peaks in (B). wrt, with respect to; Er, Fermi 
energy; vac, vacuum; m*, effective electron mass; 
Me, Stationary electron mass. 


Table 1. Key parameters for lithium-ammonia solutions and the sodium-potassium alloy. 

c, concentration; ne, electron density; m.*, effective electron mass; Me, stationary electron mass. 
Widths of the conduction band (E,) and positions of the plasmon peak (Ep) are expressed with 
respect to the Fermi energy (E-). E, and Ep were determined from fitting to a free-electron gas model 
with the effective electron masses given in the table. The effective electron masses m_* were 
obtained by fitting as described in the supplementary materials. -, not determined. 


Parameter NaK Li@NH3 


Li@NH; 


Li@NH; LI@NH; Li@NH; —Li@NH; 


c (MPM) 100 97 34 


0.08 0.012 


0.85 


E. (eV) 2.06 0.74 


4.52 2.03 1.67 


lutions their characteristic bronze or gold color 
(7, 59). A conduction band with a Fermi edge 
and a plasmon peak can also be observed for 
the 1.25 MPM potassium-ammonia solution, 
whereas for sodium-ammonia we could not 
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prepare homogeneous solutions above ~1 MPM 
because of spontaneous phase separation at 
the experimental conditions (J, 2). 

An analogous fit to a free-electron gas model 
is shown for a microjet PE spectrum of liquid 
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50/50 sodium-potassium metal alloy (Fig. 4A 
and Table 1). Here, the conduction band is 
wider and the fundamental plasmon exci- 
tation seen at higher binding energies has a 
higher frequency (~4.5 eV), as expected for 
the higher electron density in the metal alloy 
compared with the metallic lithium-ammonia 
solutions. The higher frequency places the 
plasmon in the ultraviolet range when consid- 
ering the optical reflectance of the sodium- 
potassium alloy, which does not exhibit any 
color and has a metallic silver sheen. This is 
actually true for all alkali metals except cesium, 
in which the lower free-electron density shifts 
the plasmon frequency to the visible range, con- 
ferring a golden color (59). 

The PE spectra of the ~1 to 4 MPM lithium- 
liquid ammonia solutions can be fitted by a 
combination of a localized Gaussian at 2 eV 
with a conduction band and plasmon follow- 
ing from a free-electron gas model, albeit with 
a reduced effective electron mass (Fig. 4D and 
Table 1). As the lithium concentration increases, 
the relative weight of the Gaussian contribu- 
tion to the spectrum decreases such that, at 
9.7 MPM, it practically vanishes (Fig. 4E). At 
the same time, the spectra exhibit changes 
in the shape and position of the liquid am- 
monia 3a, peak upon buildup of the metallic 
behavior of the solution (Fig. 4C). Specifically, 
in the electrolyte regime the position of the 
3a, peak almost does not change, but it does 
tend to broaden and move to lower bind- 
ing energies upon appearance of the metallic 
state (for more details, see the supplementary 
materials). 

The above results suggest that, in accord 
with the previous view (1), the electrolyte- 
to-metal transition upon increasing the metal 
concentration in alkali metal-ammonia solu- 
tions is not a sharp phase transition but rather 
a gradual conversion that resembles a perco- 
lation process, with an unresolved question 
concerning the sizes of potentially coexisting 
microscopic regions supporting localized and 
delocalized electrons (2, 35, 60, 61). This is a 
different picture than that of a sharp transi- 
tion at ~8 MPM drawn from recent PE spectra 
of alkali metal-ammonia nanodroplets (37). 
Although such experiments are pioneering in 
their own right, it is reasonable to question 
whether clusters of finite size are representa- 
tive of bulk metallic solutions in their electronic 
structure. The cluster PE spectra exhibit no- 
table differences, such as the lack of a sharp 
Fermi edge, the absence of plasmon peaks, 
and Fermi edge onset at higher rather than 
lower binding energies from the onset of the 
localized (di)electron peak (37). All of these fac- 
tors suggest a qualitatively different transition 
in the nanodroplets, taking place at substan- 
tially higher concentrations, from those previ- 
ously determined for bulk liquid systems (J, 37). 
Our present bulk liquid PES results show a 
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Fig. 5. Picturing the emergence of band forma- 
tion during the transformation from electrolyte 
(blue) to metallic solution (bronze or gold) 
upon increasing alkali metal concentration. The 
limiting PE spectra are schematically represented 
at the top and bottom of the figure, with axes 
showing binding energies with respect to the Fermi 
and vacuum level, respectively. For a free-electron 
gas (upper concentration limit), the width of the 
leading conduction band is expected to scale with 
electron number density, with the plasmon bands 
(of energy Ep) simultaneously moving away from the 
leading edge. 


buildup of a conduction band with a Fermi 
edge with increasing alkali metal concentra- 
tion even before the solution becomes visibly 
metallic (Fig. 4). This picture is also in accord 
with the semiquantitative Mott’s criterion, 
which postulates that a metallic state starts to 
appear when the mean distance between the 
electrons drops below approximately four times 
their size (62). With a ~4-A radius of gyration 
of the ammoniated electrons and dielectrons 
(Fig. 2), metallic behavior should begin to evolve 
at ~1 MPM, which is consistent with the onset 
of conduction band formation in the present 
PES measurements. Note, however, that the 
transition observed in this study is more grad- 
ual than what would strictly follow from a pure 
Mott's transition (62). 

We can thus conclude that the occurrence in 
the PE spectrum of a conduction band with a 
distinct Fermi edge, together with plasmon 
peaks, is a signature of the electrolyte-to-metal 
transition. This gradual transition is observed 
in both the lithium-ammonia and potassium- 
ammonia solutions (see Figs. 1 and 4 and the 
supplementary materials). [As mentioned 
above, at concentrations exceeding ~1.5 MPM, 
sodium-ammonia solutions phase-separate 
into immiscible electrolyte and metallic phases 
(2), which compromises the microjet PE mea- 
surements.] One can also view the process from 
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the other side—i.e., as a metal-to-electrolyte 
transition upon decreasing the alkali metal 
concentration. We see from Fig. 4 that, at the 
highest studied concentration of 9.7 MPM, 
the metallic lithium-ammonia system behaves 
similarly to an ideal free-electron gas, as does 
the liquid sodium-potassium alloy or a pure 
alkali metal (63, 64). However, upon decreasing 
the concentration of the alkali metal-ammonia 
solutions below ~4 MPM, we observe depar- 
ture from the ideal electron gas model, as ex- 
emplified by a rapid decrease of the effective 
electron mass well below the value of 1 m, 
(where m, is the mass of a stationary elec- 
tron); see Fig. 4D. A schematic is presented 
in Fig. 5 to capture the essence of the transi- 
tion. This image depicts the gradual intercon- 
version between localized “chemical” species 
(solvated electrons and dielectrons) and de- 
localized “physical” moieties (metallic con- 
duction band electrons) upon changing the 
electron concentration. 


Outlook 


The present study shows that the electrolyte- 
to-metal transition in increasingly concen- 
trated alkali metal-liquid ammonia solutions 
is a gradual process rather than an abrupt 
first-order transition, which is in line with 
previous suggestions (7). From the molecular 
point of view, this transition may be under- 
stood in a simplified way as gradual coalescence 
of individual solvated electrons and dielectrons 
upon increasing alkali metal doping, with the 
metallic behavior appearing around the per- 
colation threshold. 

After overcoming methodological difficulties 
connected to modeling the onset of the me- 
tallic state, future AIMD simulations of concen- 
trated alkali metal-liquid ammonia solutions 
will shed more light on the electrolyte-to-metal 
transition in terms of the underlying electronic 
structure and molecular geometries. On the ex- 
perimental side, the experience already gained 
from studies of liquid ammonia microjets is 
proving essential in our current attempts to 
achieve the metallic state in the much more 
reactive (even explosive) alkali metal-water 
systems. 
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WEARABLE DEVICES 


Giant thermopower of ionic gelatin near 


room temperature 


Cheng-Gong Han’*, Xin Qian®*, Qikai Li**, Biao Deng’, Yongbin Zhu’, Zhijia Han’, Wenqing Zhang®, 
Weichao Wang®, Shien-Ping Feng*, Gang Chen*+, Weishu Liu>?+ 


Harvesting heat from the environment into electricity has the potential to power Internet-of-things (loT) 
sensors, freeing them from cables or batteries and thus making them especially useful for wearable 
devices. We demonstrate a giant positive thermopower of 17.0 millivolts per degree Kelvin in a flexible, 
quasi-solid-state, ionic thermoelectric material using synergistic thermodiffusion and thermogalvanic 
effects. The ionic thermoelectric material is a gelatin matrix modulated with ion providers (KCI, NaCl, 
and KNO3) for thermodiffusion effect and a redox couple [Fe(CN)¢*-/Fe(CN),°"] for thermogalvanic 
effect. A proof-of-concept wearable device consisting of 25 unipolar elements generated more 

than 2 volts and a peak power of 5 microwatts using body heat. This ionic gelatin shows promise for 
environmental heat-to-electric energy conversion using ions as energy carriers. 


he need to power Internet-of-things (IoT) 
sensors without using cables or bat- 
teries has spurred intense research on 
energy harvesting from environment. 

One approach is thermoelectric energy 
conversion technology, which is based on 
the Seebeck effect and uses widely available 
waste heat to meet the power demands of 
IoT sensors from microwatts to megawatts 
(/, 2). Conventional electronic-thermoelectric 
(e-TE) materials are usually narrow-bandgap 
semiconductors that use electrons or holes 
as energy carriers. For a typical thermoelec- 
tric material, the thermopower (or Seebeck 
coefficient) is ~100 to 200 nV K"1. As a result, 
generating a useful voltage of 1 to 5 Vina 
room temperature environment requires either 
the challenging integration of thousands or 
even tens of thousands of tiny, ~50-um ther- 
moelectric elements (3) or a DC-DC voltage 
booster to increase the voltage of a regular-sized 
device with millimeter legs up to 100 times (4). 
An alternative route for direct energy har- 
vesting from low-grade heat was reported in 
ionic systems by exploring two very different 
mechanisms. One mechanism is based on 
redox reactions at two electrodes maintained at 
two different temperatures. Devices using this 
mechanism are called thermogalvanic cells 
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(5, 6). The other mechanism is ionic thermo- 
diffusion under a temperature gradient without 
redox reaction, also known as the Soret effect 
(7, 8). Electricity can be generated continuously 
based on the thermogalvanic mechanism as 
the redox reactants are rebalanced by ionic dif- 
fusion (9). Thermodiffusion cells operate in a 
capacitive mode (JO). After a temperature dif- 
ference establishes a voltage difference, the 
charges stored on the electrodes can be dis- 
charged to an external load. The temperature 
gradient is removed for the system to recover 
and reapplied for next cycle. Most research is 
based on either the thermogalvanic or the 
thermodiffusion cell configuration. For thermo- 
galvanic cells, liquid electrolytes with redox 
couples such as cobalt(II/II]) tris(bipyridyl) 
(11, 12), iron(II/TII) (73), iodide/triiodide (74, 15), 
and ferro/ferricyanide [Fe(CN)g*/Fe(CN),” ] 
(9, 16-23) were reported to have an absolute 
temperature coefficient of a few millivolts per 
degree Kelvin. For example, one of the highest 
negative temperature coefficients of -4.2 mV K* 
was realized in an aqueous system using the 
Fe(CN),*"/ Fe(CN),” redox couple and chao- 
tropic guanidium salts (22). For the thermo- 
diffusion cell configuration, a thermopower of 
+11 mV K "was obtained using NaOH in a poly- 
ethylene oxide (PEO) solution (0). Liquid cells, 
however, have a drawback for use in wearable 
devices because of the challenges of encapsula- 
tion (24-26). Quasi-solid-state electrolytes have 
gained attention as an alternative (27-29). A 
temperature coefficient of -1.09 and -1.21 mV K* 
was observed when using Fe(CN)5*/Fe(CN)¢* 
as a redox couple in the poly(sodium acry- 
late) and polyvinyl alcohol matrix, respectively 
(27, 28), a value lower than that of the redox 
couple in liquid solutions. High thermodiffusive 
thermopower is observed in the quasi-solid- 
state polymer gel composite of poly(vinylidene 
fluoride-co-hexafluoropropylene) (PVDF-HFP) 
and polyethylene glycol (PEG) with ionic liq- 
uid as charge carriers, and the thermopower is 


tunable from -4 to 14 mV K”’ by tailoring the 
composition (29). Furthermore, a thermopower 
as high as +24 mV K! was reported by using 
the high ionic selectivity of the NuaOH-PEO 
aqueous solution in the confined nanocelluosic 
channels, such that Na* is the major mobile 
ion (30). However, whether the thermodif- 
fusion effect and thermogalvanic effect work 
together synergistically to boost the final 
thermopower in a single ionic thermoelectric 
(i-TE) system remains an open question be- 
cause of their fundamentally different physi- 
cal pictures. 

We combined thermogalvanic and thermo- 
diffusion effects to achieve high thermopower. 
Before moving on, however, it is necessary 
to clarify our terminologies because the litera- 
ture has created some confusion. Similar to 
conventional e-TE materials, the thermodif- 
fusive thermopower (or Seebeck coefficient) 
of ions is defined as Sig = — a where 
V(Ty) and V(T¢) correspond to the voltage 
of the hot electrode at temperature 7}; and 
the cold electrode at temperature To, respec- 
tively. We clarify later that the sign of Sq is 
determined by the type of charge with higher 
thermal mobility in a solution, and thus is a 
transport property. In electrochemistry, the 
temperature dependence of the standard 
electrode potential for a reduction reaction 
(E°) at the isothermal condition is referred 
to as the “temperature coefficient” as ag = 
dE°/dT (31, 32), where ap is a thermodynamic 
property. For a redox reaction O + nes R, 
where the oxidized species O is converted into 
the reduced species R with n mole of elec- 
trons (e) transferred per unit mole of reac- 
tion, the temperature coefficient is ag = ==, 
where So and Sp are partial molar entropies of 
the species O and R, respectively, and F is 
the Faraday constant. In a thermogalvanic 
cell under a temperature gradient, the redox 
reaction contribution to the measured volt- 
age is V(Tq) = V(Te) = OR (Ty = Tc), which 
means that the sign of ap is opposite to the 
sign convention of the Seebeck coefficient 
(33). In addition to the redox contributions, 
the thermodiffusion of redox species under 
a temperature gradient also contributes to 
the total voltage, which is usually negligible 
(~ 10 pV K”’) in aqueous solutions (33). We 
report a giant thermopower of 17.0 mV K"' in 
a quasi-solid-state i-TE material by combin- 
ing the thermodiffusion effect of KCl and 
the temperature coefficient of a Fe(CN),*7/ 
Fe(CN),° redox couple. The general strategy 
is to use a negative temperature coefficient 
(i.e., Gr < 0) and a p-type thermodiffusive 
thermopower (i.e., Sig > 0) to generate a high 
differential thermal voltage S;. Using such 
materials, a high output voltage of 2.2 V is 
achieved using body heat in a wearable and 
flexible i-TE device with only 25 unipolar ele- 
ments in series working in a quasicontinuous 
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mode. The proof-of-concept device demon- 
strates promising application of ionic gela- 
tin in powering wearable IoT applications. 


Giant thermopower of i-TE materials 


We denote the as-fabricated i-TE materials as 
Gelatin-2 MX-m/n FeCN*’* (MX = KCl, NaCl, 
KNOs), where x and m/n are the molar con- 
centrations of MX and K,Fe(CN)¢/K,Fe(CN)¢, 
respectively, in which Fe(CN),*/Fe(CN)¢” 
serves as the redox couple (hereafter abbre- 
viated as FeCN*/ °-) and the ion provider MX 
further boosts the thermodiffusive thermo- 
power. We chose organic gelatin for the matrix 
because of its abundance, low cost, high bio- 
compatibility, and mechanical flexibility. We 
found that thermodiffusion of ionic species 
under a temperature gradient, together with 
the thermogalvanic effect of redox couple 
FeCN*/ 3- contributes to the high thermo- 
power of i-TE materials of Gelatin-~ KCl-m/n 
FeCN*’*. We observed an improved thermo- 
power from 4.8 to 12.7 mV K”™ by increasing 
the concentration of KCl from 2 = 0 M to x = 
0.8 M in the as-fabricated Gelatin-7 KCI1-0.42/ 
0.25 M FeCN*“* (Fig. 1A and fig, SI). We achieved 
a further improved thermopower from 12.7 to 
17.0 mV K* by tailoring the volume ratio of 
water to gelatin (Fig. 1A). This value is much 
higher than other reported gel-based i-TE 
materials by using either a thermodiffusion 
effect or a thermogalvanic effect (Fig. IB and 
table S1). 

The thermodiffusion of KCl in gelatin showed 
a p-type thermopower. We then used the 
FeCN*’* redox couple, which has a negative 
temperature coefficient, to achieve a synergis- 
tic effect. Because ag is related to the entropy 
change of reduction reaction, the negative tem- 
perature coefficient ag = #®"E* < 0 
indicates that FeCN* has lower solvation 
entropy than FeCN”,, which is consistent with 
the solvation shell being more tightly packed 
around FeCN* because of its higher valence 
charge (35). At the hot electrode, the oxidation 
reaction FeCN* — e + FeCN® is thermody- 
namically favorable and injects electrons into 
the hot electrode, increasing its electrochem- 
ical potential (i.e., lower voltage) and generat- 
ing a thermopower (33) that is consistent with 
the thermodiffusion contributions of KCl. At 
the cold side, the reduction reaction FeCN® + 
e — FeCN“ is thermodynamically favored, 
with electrons attracted from the electrode, 
resulting in a decreased electrochemical po- 
tential (i.e., higher voltage). The redox cou- 
ple therefore works together synergistically 
to achieve the high p-type thermopower in 
the as-fabricated i-TE materials of Gelatin-7 
KCl-m/n FeCN*/*-, 


Optimization of thermopower 


The optimization of the as-fabricated i-TE ma- 
terials of Gelatin-z MX-m/n FeCN*!* in- 
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volved tuning of the concentration of the ion 
providers (MX = KCl, KNO; and NaCl), the 
redox couple (FeCN*/ 3), and the volume 
ratio of water to gelatin. We obtained a thermo- 
power of 14 mV K™ from W(Tc¢) - VT) and 
Ty - Tc measurements (fig. S1) for the FeECN*/* 
redox couple in an aqueous electrolyte with Cu 
foils as the symmetric electrodes (Cu | aqueous 
FeCN*/- | Cu). Our measurements were in 


Thermodiffusion effect 
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good agreement with the previously reported 
value (1.4 mV K7') (9). We observed a leap in 
thermopower from 1.4 to 4.8 mV K”™? in 
Gelatin-FeCN*/* (@ = 0 M m/n = 0.42/0.25 M) 
compared with the pristine FeCN*’* solu- 
tion (fig. S1). The pure gelatin had a reference 
thermopower of 13 mV K7 caused by the 
thermodiffusion of H* from the ionization of 
carboxyl groups -COOH (36), whereas the 
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Fig. 1. Giant thermopower of i-TE materials. (A) Comparison of the thermopower among the as-fabricated 
i-TE materials of Gelatin-x KCl-m/n FeCN*-/3- (x is KCl and m/n are KaFe(CN)¢/K3Fe(CN)¢ molar concen- 


trations, respectively) in this work as Gelatin (x = 0 M, m/n = 0M), Gelatin-FeCN*“* (x = 0 M, m/n = 


0.42/0.25 M), Gelatin-KCl (x = 0.8 M, m/n = 0 M), and Gelatin-KCl-FeCN* “> (x = 0.8 M, m/n = 0.42/0.25 M, 
volume ratio of water to gelatin r, = 2.0 and 3.0). (B) Absolute thermopower of i-TE materials containing 


the thermodiffusion effect or the thermogalvanic effect (table S1). The filled and unfilled columns represent 


p-type and n-type thermopower, respectively. (C) Thermopower of i-TE materials of Gelatin-x KCl, Gelatin-x KNOs, 
and Gelatin-x NaCl with varying concentrations of KCI, KNO3, and NaCl. (D) Thermopower of i-TE materials of 
Gelatin-x KCI, with varying pH values tuned by HCI and KOH, respectively. (E) Thermopower of i-TE materials of 
Gelatin-x KCl-m/n FeCN*’?- with the fixed x = 0.8 M. (F) Thermopower with the dependence of volume ratio 
of water to gelatin for Gelatin-0.8 M KCI-0.42/0.25 M FeCN*” 3 r, = 2.0 was maintained in (C) to (E). 
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Gelatin-0.42 M FeCN* and Gelatin-0.25 M 
FeCN* had a thermopower of 1.2 and 1.0 mV 
K" (fig. S2A), respectively. 

We investigated the thermodiffusion effect 
of ion providers by comparing three series of 
gelatin-based i-TE materials: Gelatin-~ KCl, 
Gelatin-v KNOs, and Gelatin-v NaCl with x = 0, 
0.1, 0.3, 0.5, 0.8, and 1 M, respectively (Fig. 
1C and fig. S3). Gelatin-7 KCl had an increased 
thermopower from 4.3 mV K™ to a peak value 
of 6.7 mV K”™ as the concentration of KCl 
increased from 2 = 0.3 to 0.8 M, and then a 
decline when the concentration of KCl in- 
creased further. The Gelatin-a NaCl also had 
a similar peak thermopower of ~ 6.7 mV K! 
but at the concentration of x = 0.3 M. The 
Gelatin-z KNO; had a lower peak thermo- 
power of ~3 to 4 mV K 1 in the range of x = 
0.5 to 0.8 M. 

Theoretically, the contribution to the thermo- 
power of mobile cations and anions in i-TE 
materials could be analogous to the multiband 
transport in e-TE materials. The temperature 
gradient drove both cations and anions to 
migrate across the device from the hot side 
to the cold side, resulting in a net charge ac- 
cumulation and an internal electric field that 
generated voltage. We derived the total thermo- 
diffusive thermopower of a symmetrical electro- 
lyte, such as Gelatin-x KCl, based on the Onsager 
transport theory as follows (33): 


_ D484 — D_S_ 


“ e(D, + D_) () 


“i “oo” 


where the subscript “+” or denotes the 
ion species, e is the elementary charge, D 
and § are the mass diffusion coefficient and 
the Eastman entropy of transfer, respective- 
ly, and S$ is essentially the temperature de- 
pendence of the free energy dG/dT, which is 
related to the interaction between solutes 
and the surrounding media (37). Cations and 
anions in i-TE materials are equal so that 
the ionic thermodiffusion is ambipolar, a 
difference from e-TE materials. Analogous to 
the Einstein relation for diffusion driven by a 
concentration gradient, thermal mobility 
can also be defined as DS/kgT (33). The pos- 
itive thermopower suggested that the thermal 
mobility (D,.$,/kgT) of cation K* was larger 
than that (D_S_/kgT7) of anion CI. The ionic 
interactions induced by negatively charged 
gelatin network could generate a larger East- 
man entropy of transfer s +4, which might be 
responsible for the large p-type thermodiffusive 
thermopower. Alternatively, the complicated 
relation between the diffusion coefficient and 
the concentration in the matrix with a charged 
polymer network may also be responsible. Ex- 
periments (38) and computational analysis 
(39, 40) have shown that in a negatively charged 
polymer network, the cations have a higher 
diffusion coefficient. A small fraction of cat- 
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ions tend to “condensate” along negatively 
charged polymer chains. This “counterion 
condensation” was proposed by Manning (47). 
These immobilized K* condensed near the 
polymer could further impose frictional drags 
on Cl, which would reduce the mobility of Cl. 
However, the rest of K* not condensed around 
the polymer backbones remains more mo- 
bile compared with the Cl that was dragged 
by the condensed immobile K*. We observed 
that the thermopower is concentration de- 
pendent. As the concentration increases, 
the fraction of mobile cations increases com- 
pared with the condensed cations (39). Fur- 
ther increasing the concentration could 
decrease the Debye length of the electrical 
double layer and induced a screening ef- 
fect of the ionic coupling between the ions 
and gelatin, and the thermal mobility of ions 
tends to converge to pure KCI solution, which 
has a negligibly small thermopower measured 
to be ~40 nV K! (34). This tradeoff could ex- 
plain the existence of maximum thermo- 
power in Gelatin-v KCl and Gelatin-v KNOs. 
The lower thermopower of Gelatin-~ KNO3 
compared with Gelatin-x KCl can also be 
attributed to the smaller difference between 
the thermal mobility of Ky and NO; .. NO; is a 
stronger water-structure breaker compared 
with Cl, resulting in a higher mass diffusion 
coefficient D_ (42), which is consistent with the 
ionic conductivity measurement (fig. S4). Thus, 
the NO; cancels more thermopower than Cl" 
in the as-fabricated gelatin-based i-TE mate- 
rials. Moreover, we found that the pH values 
affected the thermopower of the i-TE material, 
ie., Gelatin-x KCl (@ = 0.8 M) (Fig. 1D and fig. 
S5, A and B), because of the ionization of 
gelatin functional groups (-COOH), which 
could affect the ion-gelatin interaction and 
effectively change the Eastman entropy of 
transfer of ions. We observed an optimized 
thermopower of 6.7 mV K™ at pH = 7.0. Addi- 
tionally, we investigated the Gelatin-x K,SO4 
(@ = 0.25, 0.40 and 0.50 M) with divalent 
anions (fig. S5C). Among the investigated 
concentrations, the Gelatin-v K,SO,4 (w = 
0.40 M) showed the highest thermopower 
at 4.9 mV K", which is much less than the 
6.7 mV K~ of Gelatin-w KCl (x = 0.8 M). 
Adding FeCN*’*" into the Gelatin-v KCl sys- 
tem makes the thermopower sensitive to the 
concentration of the FeCN*/* redox couple. 
The thermopower varied from 6.7 mV K to 
8.3, 10.4, 12.7, and 7.7 mV K"' as x = 0.8 M, 
whereas m/n changed from 0/0 M to 0.08/ 
0.05, 0.25/0.15, 0.42/0.25, and 0.50/0.30 M, 
respectively (Fig. 1E and fig. S2B). We re- 
peatedly observed the highest thermopower of 
12.7 mV K” in the as-fabricated i-TE material 
of Gelatin-0.8 M KCI-0.42/0.25 M FeCN*/* 
(fig. S6). We measured a lower thermopower 
of Gelatin-0.8 M KCl-m/n FeCN*’* with m/n = 
0.25/0.25 M (11.0 mV K7) and 0.42/0.42 M 


(7.3 mV K) compared with m/n = 0.42/0.25 M 
(fig. S2C). We attribute the high thermopower 
to the synergy of the thermogalvanic effect of 
redox couple FeCN*“* and the thermodif- 
fusion effect of the mobile ions. Additionally, 
the thermal conductivity of the i-TE material 
Gelatin-0.8 M KCI-0.42/0.25 M FeCN*’* is 
low (0.15 W m’ K‘ at 293 K), allowing it to 
maintain a temperature difference for power 
generation (fig. S7) (33). We observed excel- 
lent reversibility of the redox reaction evi- 
denced by the overlapped peaks scanned for 
three cycles in CV curves (fig. S8). We ob- 
served the anodic and cathodic peaks from 
0.05 to 0.28 V and -0.05 to -0.28 V (versus Pt), 
respectively, in the CV curves of the Gelatin- 
0.8 M KCl-m/n FeCN*”*" (fig. S9A). We found 
increasing redox peak potential (£,) and cur- 
rent density with increasing m/n values (fig. 
SOB). Additionally, the oxidized species (FeCN® ) 
generated at the hot side and the reduced 
species (FeCN*’) generated at the cold side 
migrated to the other electrode under a con- 
centration gradient, making continuous cur- 
rent output possible (9, 43). 

The water/gelatin volume ratio (r,) also 
boosted the thermopower of the as-fabricated 
Gelatin-0.8 M KC1-0.42/0.25 M FeCN*?’* sys- 
tem. The water in the gelatin matrix provides 
the diffusion channel for ions in the quasi- 
solid-state i-TE material, affecting the thermo- 
power (Fig. IF and fig. S10). We varied 7, values 
and observed a continuous increase from 12.7 
to 17.0 mV K ‘as 7, increased from 2.0 to 3.0. 
Increasing 7, further to 3.3 decreased the 
thermopower to 14.1 mV K”™ (Fig. 1F). Higher 7, 
also reduced the fracture strain and stretchabil- 
ity. We fixed 7, at 2.0 for device demonstration. 


Mechanism of synergistic effect 


This section explains the synergy between the 
thermodiffusion and thermogalvanic effects 
(Fig. 2, A to C). The thermodiffusion of KCl 
accumulated positive net charges near the cold 
electrode, generating an electric field pointing 
from the cold electrode to the hot electrode 
(Fig. 2A). This generated a thermodiffusive volt- 
age AVig = Atu—te = V(Tu) — V(Tc) < 0. 
The higher solvation entropy generates more 
FeCN* than FeCN“* at higher 7 (35) through 
oxidation. This transfers electrons to the hot 
electrode increases the electrochemical poten- 
tial (pq). FeCN * generation was promoted and 
extracted electrons from the cold electrode. The 
T gradient drives thermodiffusion and balances 
the redox reaction. Consequently, the thermo- 
galvanic effect shifts the 1 of both electrodes 
in the same direction as the thermodiffusion 
effect. The thermogalvanic voltage that we 
measured was the difference in standard elec- 
trode potential AE® = — "™—"c < Q, which has 
the same sign as the thermodiffusive voltage. 
The FeCN*’* also participated in thermodif- 
fusion and contributed to the final thermopower. 
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From the Onsager transport formulation, we Si = -Op + Stg(K* - FeCN ual a ae thermodiffusion of mobile ions, and S,q(gelatin) 
see how a large, positive thermopower comes Sta(KCl) + Sta(gelatin) (2) is the intrinsic thermopower of the gelatin. 
from the coupling of the thermodiffusion and We used an isothermal three-electrode sys- 


thermogalvanic effects (33). We observed that | where -op is the contribution to thermo- | tem (Fig. 2D) to effectively eliminate the T 
the total thermopower S; could be written as | power from the redox reaction FeCN® +e + | gradient and determine the temperature co- 


a summation: FeCN*”, Sq is the thermopower from the | efficient (33). The contribution from the redox 
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Fig. 2. Mechanism of the synergistic effect. Electrochemical potential (1) was used as the reference electrode (RE) and counter electrode (CE). (E) Thermo- 


of charge carries diagrams and the corresponding voltage (V) distribution of power caused by redox entropy change of FeCN*’?" (-ag) measured from 
i-TE material of Gelatin-x KCl-m/n FeCN* “> as (A) Gelatin-KCl (x = 0.8 M, (D) and total value (Si). (F) Schematic figure of the diffusion, redox reaction, 
m/n = 0M), where E represents the built-in electric field; (B) Gelatin-FeCN*-’*- —_and interaction of the ions in the as-fabricated i-TE materials of Gelatin-x 


(x = 0 M, m/n = 0.42/0.25 M); and (€) Gelatin-0.8 M KCI-0.42/0.25 M KCl-m/n FeCN*“?- under the temperature gradient. (G) Fractional contribution 
FeCN*-’>-. (D) Isothermal system of Gelatin-FeCN*-’>- for measuring the to thermopower of i-TE material Gelatin-0.8 M KCI-0.42/0.25 M FeCN*-/3- 
entropy of FeCN*’?-. The work electrode (WE) was platinum, whereas SCE (ry = 2.0). 


Han et al., Science 368, 1091-1098 (2020) 5 June 2020 4 of 7 


RESEARCH | RESEARCH ARTICLE 


couple was finally determined to be —ag = 
2.27 mV K"! (Fig. 2E and fig. S11) by com- 
pensating the temperature coefficient of the 
saturated calomel electrode (SCE) (33). Figure 
2F shows a schematic illustration of partial 
contribution thermopower in a complex sys- 
tem containing K*, Cl, FeCN*”*, and water, 
as well as a gelatin molecule structure. Rela- 
tive contribution to the total thermopower 
in Gelatin-0.8 M KCl-0.42/0.25 M FeCN*’ 
(7, = 2.0) is determined as follows (see Fig. 
2G): 10.2% contribution of Gelatin, 17.9% of 
redox entropy of FeCN*/ > 9.7% contribu- 
tion of thermodiffusion of K3;Fe(CN),¢ and 
K,Fe(CN).¢, and 62.2% contribution of ther- 
modiffusion effect of KCl (33). We conducted 
additional experiments by switching the 
direction of the temperature differences be- 
tween two electrodes, and observed a hyster- 
esis showing the dynamical response of the 
device to the transient temperature field (figs. 
$12 and S13) (33). 


Working modes of an i-TE cell 


A thermodiffusion effect-based i-TE cell is es- 
sentially capacitive (J0, 30) because the dis- 
charge current is nonfaradaic and no electrons 
transport across the electrode-electrolyte inter- 
faces. A thermogalvanic cell works in a contin- 
uous manner, with redox couples reacting in 
opposite directions on the hot and cold elec- 
trodes and ionic diffusion supplying the re- 
actants to electrode surfaces, thus ensuring 
continuous operation (9). We demonstrate a 
quasicontinuous working mode by using the 
i-TE material of Gelatin-0.8 M KCI-0.42/0.25 M 
FeCN*’* (7, = 2.0). We assembled the i-TE 
cell in a laminar structure of Cu | Au | i-TE | 
Au | Cu (15 x 15 x 1.8 mm). We maintained 
the cold side at 293 K and the hot side at 
301.5 K (AT = 8.5 K). The as-fabricated i-TE 
cell was charged in ~55 min to reach a high 
(near-saturation) voltage. We then stepped it 
into the quasicontinuous working mode. The 
cell discharged to 0 V in 10 s by connecting 
to an external circuit with a current linearly 
ramped up from 0 to maximum, and then 
recovered back to the high voltage in 3 min 
in open circuit under the same applied tem- 
perature difference. In the discharge process, 
the electrons flowed from the hot side to the 
cold side through the external circuit, result- 
ing in a decreased internal electrostatic field 
and hence the cell voltage. The discharging 
current is also a synergistic result of redox 
couples and ion providers, contributed par- 
tially by the faradaic process caused by the 
redox couple FeCN*’* and the capacitive de- 
sorption of K* and Cl. Once the external 
circuit is disconnected, the diffusion of the 
redox couple resupplies the consumed spe- 
cies to the electrode and the concentration 
profile of ion providers reestablishes, so the 
cell voltage recovers, allowing for the next 
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discharge cycle (fig. S14). We completed 100 
of these charge-discharge cycles (Fig. 3A) 
over a time span of 5 hours. The correspond- 
ing power curve of the fifth cycle displayed 
parabolic behavior with the maximum at 8 1.W 
(Fig. 3B). We expect that such quasicontin- 
uous operation can last much longer until 
the electrodes are fully polarized (33). Out- 
put power decreased as the quasicontinuous 
cycle number increased (Fig. 3C, inset), which 
was probably caused by the polarization of 
the electrodes. To solve this issue, we reac- 
tivated the i-TE cell by removing the temper- 
ature difference and totally cooling down 
the cell while short-circuiting the electrodes. 
The reactivated cell recovered the voltage 
and current (fig. S15). The concentrations of 
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all ionic species redistributed and the elec- 
trodes were depolarized after this process 
(33). We reproducibly achieved high ther- 
mally charged voltage over several consecu- 
tive days (fig. S16). This demonstrates that 
the cell can be used repetitively rather than 
being a one-time energy source. We reduced 
the thermal charge time from 3 min to ~20 s 
by increasing number of the layers of the i-TE 
cell from one to three (Cu | i-TE | Cu | i-TE | Cu | 
i-TE | Cu, 15 x 15 x 1.8 mm). The internal 
electrode shortened the time for ions to diffuse 
across the shortened distance, and hence short- 
ened the thermal charging process (fig. S17). 
We coated the Cu foils (10 um thickness) 
with Au (40 nm) because electrode corro- 
sion is a performance concern and found a 
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Fig. 3. Working mode of an i-TE cell. (A) Quasicontinuous thermal charge/electrical discharge process for 
an i-TE cell measured for 100 cycles [Cu | Au | i-TE | Au | Cu, 15 x 15 x 18 mm, Au (40 nm) coated rough 
Cu foils]. (B) Power (line), voltage (dashed line), and output current (dashed and dotted line) curves of 
discharge process at the fifth cycle in (A). (C) Corresponding total energy density of initial 50 cycles for i-TE cell 
with rough Cu | Au (40 nm) and smooth Cu as electrodes. Normalized output power Pma,/(AT)* and 
maximum output current of 100 cycles in an i-TE cell (Cu | Au | i-TE | Au | Cu, 15 x 15 x 1.8 mm) are shown in 
the inset. (D) Continuous thermal charge/electrical discharge process for the i-TE cell [Cu | Au | i-TE | Au | Cu, 
15 x 15 x 18 mm, Au (40 nm)-coated rough Cu foils] at the external resistor R = 5000 Q and AT = 8 K. 

(E) Power of the continuous discharge process at the different external resistors and AT ~ 8 K. The inset shows 
the measurement circuit. (F) Corresponding energy density at the different external resistors. The energy 

was calculated by the integration of power to time (1 hour) shown in (E). 
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comparable thermopower (fig. S18). However, 
the total energy density of initial 50 cycles was 
much higher (7.4 J m~”) than for Cu foil elec- 
trodes (1.5 J m~”) (Fig. 3C and fig. S19). The Au 
(40 nm)-coated Cu foil electrode has an en- 
larged surface area (fig. S20) (21, 44). We also 
measured a slightly (8%) higher thermopower 
using a Pt electrode compared with the Cu foil 
electrode (fig. S21). Electrode optimization may 
boost the output power density of a gelatin- 
based i-TE cell. We calculated the specific 
pulsed power density, Pmax/(AT)” = Voel/sc/ 
(2AT)*, where V,,. and I,, are the open-circuit 
voltage and short-circuit current, respectively. 
We measured the maximum output power 
density at 0.66 mW m~ K~, which is one or 
two orders higher than previously reported in 
a gel-based i-TE cell (Fig. 3C and table S2). 
We show the as-fabricated i-TE cell with Au- 
coated copper electrodes in continuous work- 
ing mode. We initially thermally charged the 
cell at AT = 8 K to reach a near-saturated 
voltage and then electrically discharged at the 
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Fig. 4. Proof-of-concept of wearable i-TE device. (A) Tensile test of the i-TE 
material of Gelatin-0.8 M KCI-0.42/0.25 M FeCN*’* (r, = 2.0) compared 

with pure gelatin. (B) Voltage generated from a proof-of-concept flexible i-TE 
wearable device with 25 unipolar elements (Cu | i-TE | Cu, 5 x 5 x 1.8mm, 
smooth Cu foil) in series worn on the back of the human hand. (C) Power (line), 
voltage (dashed line), and output current (dashed and dotted line) curves of 
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same AT with a constant external resistance of 
5000 © (Fig. 3D). The output voltage and out- 
put power (Fig. 3E) initially decayed rapidly 
but saturated to a constant value with the ex- 
ternal resistor, reaching steady-state thermo- 
galvanic operation mode. We calculated the 
energy density (Fig. 3F) for a range of external 
resistance values, which had parabolic behav- 
ior and saturated at 12.8 J m™?. This value is 
higher than that in the quasicontinuous work- 
ing mode (Fig. 3C). 


Proof-of-concept wearable i-TE device 


An ionic liquid in a polymer gel i-TE cell based 
on thermodiffusion was demonstrated by 
Zhao et al. and achieved a device thermo- 
power of 0.33 V K™’. This device combined 
18 pairs of n- and p-type elements (29). Using 
25 p-type unipolar elements allowed us to 
reach a comparable device thermopower. 
Our i-TE materials are highly flexible and 
suitable for wearable electronics applications 
(fig. S22). After bending the Gelatin-0.8 M KCI- 
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0.42/0.25 M FeCN*’* 5000 times, it had sim- 
ilar values of the voltage and output power 
density (fig. S23). The addition of KCl and 
FeCN*”* could potentially improve the stretch- 
ability, which we strained to 200%, compared 
with a 140% strain for the pure gelatin (fig. S24). 
The as-fabricated i-TE materials of Gelatin-0.8 M 
KC1-0.42/0.25 M FeCN*’* (7, = 2.0) remained 
intact after stretching from 3 to 7.2 cm and 
recovered after release (Fig. 4A). 

The giant thermopower of the as-fabricated 
ionic gelatin i-TE materials (Gelatin-x KCl-m/n 
FeCN** ?-) provides a promising solution for 
the voltage needed for IoT sensors in a near- 
room temperature environment. We constructed 
a flexible and wearable i-TE device assembled 
by serially connecting 25 i-TE elements using 
copper-only electrodes (Fig. 4B). This device 
can be worn at the back of hand (Fig. 4B, 
inset). We obtained a voltage of 2.2 V ina 
cold environment (AT of ~10 K). The voltage 
generated by our device is enough to drive 
different sensors without additional DC-DC 


Discharge 
Lowe 


Temperature (K) 


8 12 16 20 
Time (min) 


Wearable device 


Quasi-solid 
state i-TE 


Output power (uW) 


the proof-of-concept wearable i-TE device by harvesting real body heat. 

(D) Performance comparison in output voltage and power of the wearable 
device using e-TE materials and quasi-solid-state i-TE materials worn on a real 
human body. N represents the number of the n- or p-type thermoelectric 
elements in the wearable devices. The i-TE material used was Gelatin-0.8 M 
KCI-0.42/0.25 M FeCN* “3 (r, = 2.0). 
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voltage boosters, i.e., humidity sensors (1.6 - 
3.6 V), pressure sensors (1.5 to 3.6 V), and gas 
sensors for monitoring indoor air quality (1.8 to 
3.6 V). We measured current-voltage-output 
power (U-V-P) curves of our 25-element as- 
sembled i-TE device (Fig. 4C). We obtained a 
pulsed output power of 5.0 uW and a closed- 
circuit current of 8.5 uA in a 10-s discharge 
process, corresponding to the electricity en- 
ergy of 3.5 x 10° J after a single thermal 
charge. This harvested energy is enough for 
powering many commercial sensors, i.e., 0.7 x 
10° J of the digital temperature sensor (Si705x, 
Silicon Laboratories; operating voltage 1.9 to 
3.6 V, 195 nA average current at 1 Hz sample 
rate) and 1.1 x 10° J of the low-power hu- 
midity sensor (HDC2010, Texas Instruments; 
operating voltage 1.6 to 3.6 V, 0.3 uA average 
current at a 1-Hz sample rate). We compared 
the output voltage and power of our i-TE 
wearable device with other reported i-TE and 
e-TE devices that use human body heat (Fig. 
4D), and our as-fabricated i-TE wearable de- 
vice was two to three times that of some pre- 
viously reported i-TE devices (28, 45) and two 
orders of magnitude higher than e-TE devices 
(46-48). 


Summary 


We have demonstrated a giant thermoelectric 
effect in an ionic gelatin-based i-TE material, 
Gelatin-w KCl-m/n FeCN*’*-, which synergis- 
tically combines the thermogalvanic effect 
of a redox couple (FeCN teh -) and the thermo- 
diffusion effect of ion providers. High posi- 
tive thermopower of 12.7 ~ 17.0 mV K™' was 
achieved by comprehensively optimizing the 
concentration of KCl (@ = 0.8 M) and the 
FeCN*’* (m/n = 0.42/0.25 M) and water ratio. 
A proof-of-concept flexible i-TE wearable de- 
vice with 25 p-type elements shows a high volt- 
age up to 2.2 V, and a pulsed output power 
of 5.0 unW with total output energy of 3.5 x 
10° J are extracted in a single discharge pro- 
cess by using the real heat of the human body 
with AT ~10 K, enough to power many IoT 
sensors. The generated voltage is two to three 
times higher than those of previously reported 
i-TE devices. The as-fabricated i-TE cell can 


Han et al., Science 368, 1091-1098 (2020) 


5 June 2020 


work in a quasicontinuous thermal charge/ 
electrical discharge mode for long-time usage, 
but can also work in continuous mode, deliv- 
ering a maximum energy density of 12.8 J m™?. 
This work provides a promising approach to 
realizing cable- and battery-free energy supplies 
for IoT sensors, demonstrating the promise 
of using ions as the energy carriers in thermo- 
electric energy conversion. 
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lridium-catalyzed acid-assisted asymmetric 
hydrogenation of oximes to hydroxylamines 


Josep Mas-Rosellé', Tomas Smejkal’, Nicolai Cramer™* 


Asymmetric hydrogenations are among the most practical methods for the synthesis of 
chiral building blocks at industrial scale. The selective reduction of an oxime to the 
corresponding chiral hydroxylamine derivative remains a challenging variant because of 
undesired cleavage of the weak nitrogen-oxygen bond. We report a robust cyclometalated 
iridium(II|) complex bearing a chiral cyclopentadienyl ligand as an efficient catalyst for this 
reaction operating under highly acidic conditions. Valuable N-alkoxy amines can be accessed 
at room temperature with nondetected overreduction of the N-O bond. Catalyst turnover 
numbers up to 4000 and enantiomeric ratios up to 98:2 are observed. The findings serve 

as a blueprint for the development of metal-catalyzed enantioselective hydrogenations 


of challenging substrates. 


symmetric hydrogenation with homoge- 
neous transition metal catalysts is one 
of the most efficient methods for the 
preparation of single enantiomers at in- 
dustrial scale (7, 2). The enormous pro- 
gress in highly enantioselective reduction of 
prochiral olefins and ketones is tightly linked 
to the development of new chiral ligand archi- 
tectures (3, 4). In the context of chiral amine 
synthesis, the catalytic asymmetric hydrogen- 
ation of imine or enamine precursors is a well- 
established method and is frequently used 
industrially (5). In stark contrast, related metal- 
catalyzed hydrogenations of oximes to pro- 
duce chiral hydroxylamines have proven elusive 
(Fig. 1B). These substrates are often inert, 
and when reactivity is observed, undesired re- 
ductive cleavage of the labile N—O bond leads 
to primary amines (6). Therefore, the devel- 
opment of a complementary homogeneous 
hydrogenation mode is required. 

The N-alkoxy amine group is an increasingly 
common motif in agrochemicals and pharma- 
ceuticals, with the N—O bond offering favorable 
physical and biological properties (7). Com- 
pared to the related, more abundant, chiral 
amine moieties in drugs (8), current bioactive 
N—O compounds either lack chirality or are 
marketed as racemates (Fig. 1A). A practical 
asymmetric synthesis would facilitate incor- 
poration of chiral three-dimensional hydrox- 
ylamine scaffolds as design elements in drug 
discovery (9). So far, only the use of substoi- 
chiometric to stoichiometric amounts of chiral 
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oxazaborolidine borane adducts was shown to 
yield hydroxylamine products in an enantio- 
selective fashion. However, those reactions 
suffer as well from undesirable primary amine 
by-products, depending on the oxime struc- 
ture (10, 11) (Fig. 1C). Moreover, costs and 
waste build-up make this method difficult to 
scale. Here, we present cyclometalated chiral 
iridium(IID complexes bearing a chiral cyclo- 
pentadienyl ligand (Fig. 1D, purple) and an 
achiral aryl imine C,N-chelate (Fig. 1D, green). 
We apply them for the enantioselective hydro- 
genation of protonated oximes to hydroxyl- 
amine derivatives, showcasing their potential 
in asymmetric catalysis. Related C,N-chelated 
half-sandwich Ir(III) complexes (72) have al- 
ready found diverse applications in catalysis, 
including hydrogenation, dehydrogenation, 
oxidation, and hydrofunctionalization, among 
other transformations (13-15). 

Preliminary studies revealed that cyclo- 
metalated Cp*-iridium complex Ir1 (Fig. 2) 
engages in highly efficient homogeneous oxime 
hydrogenations in the presence of stoichiomet- 
ric amounts of a strong Bronsted acid (16). The 
reaction is fully chemoselective toward reduc- 
tion of the C=N bond of oxime, showing no 
reductive cleavage of the N—O bond. The re- 
quired acid assistance in the reaction sug- 
gests an ionic hydrogenation mechanism (fig. 
S1), whereby a protonated substrate receives 
a hydride from a metal complex via an outer- 
sphere mechanism (7). The enantiodetermining 
facial-selective hydride delivery to the non- 
coordinated substrate is often the slowest step 
(18). This is distinct from classical homoge- 
neous hydrogenation, where the substrate is 
bound to a metal center and subsequently 
receives the two hydrogen atoms from the 
same catalyst entity (19). The strong Bronsted 
acid fulfils a triple role: (i) The oxime sub- 
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strate (~5 orders of magnitude less basic than 
an imine) is protonated, activating it toward 
hydride addition; (ii) the conjugated base dis- 
sociates from iridium, facilitating dihydrogen 
coordination and its subsequent heterolytic 
cleavage into a proton and a hydride source 
(20); and (iii) N-protonation of the basic hy- 
droxyl amine product prevents catalyst poi- 
soning (Fig. 1D). The need for stoichiometric 
amounts of a strong acid severely complicates 
the use of chiral proton sources (e.g., chiral 
phosphoric acids). We hypothesized that chi- 
ral cyclopentadienyl (Cp*) ligands, which have 
emerged as powerful ligands for transition 
metal-catalyzed C-H functionalizations (27), 
constitute a potential entry point for enantio- 
selective oxime hydrogenation. Although 
transient cyclometalated species of Cp“ metal 
complexes are frequent intermediates in C-H 
functionalizations (22), their use as stable 
cyclometalated complexes for catalytic pur- 
poses has been far less explored. 

Oxime substrates 1 were typically obtained 
as E/Z-diastereomeric mixtures. When required 
(see below), separation by silica gel column 
chromatography delivered the pure bench- 
stable EF and Z isomers. Air- and moisture- 
stable iridium(II]) complexes Ir1 to Ir4 were 
accessed in a straightforward two-step se- 
quence. Their subsequent evaluation as selec- 
tive catalysts in the reduction of oxime F-la 
to N-tert-butoxylamine 2a is summarized in 
Fig. 2 and fig. $2. The hydrogenation tests 
were conducted with 1 mol % of the iridium 
complex, 1.5 equivalents of methanesulfonic 
acid (MsOH), and 50 bar of Hz at 23°C in 
2-propanol (/6). Exposure of E-1a to achiral 
complex Irl, bearing an acetophenone imine 
as the lower chelate portion, resulted in quan- 
titative formation of 2a with no detected 
overreduction by nuclear magnetic resonance 
(NMR) analysis. Using (S)-Ir2 where the Cp* 
unit was replaced by our chiral binaphthyl- 
derived Cp* ligand (23) gave (S)-2a in 41% 
yield and 70:30 enantiomeric ratio (e.r.). En- 
couraged by this proof of principle, we tailored 
the lower chelating C,N-ligand architecture for 
the transformation resulting in Ir3. Additional 
3,5-dimethyl groups of the aniline unit and a 
cyclic rigid tetralone backbone with an ethyl- 
ene glycol ether adjacent to the iridium boosted 
the catalyst performance, giving 2a in >99% 
yield and improved 89:11 e.r. In particular, the 
proximal (2-methoxyethyl) ether substituent 
rendered the complex more robust toward 
deactivation. In addition, the oxygen atoms 
of the tail might engage in hydrogen-bonding 
interactions with the substrate (24). The enantio- 
selectivity of the hydrogenation was further 
improved by retaining the optimal C,N-ligand 
and tuning the capping chiral Cp* ligand, re- 
sulting in Ir4, which has the Cp* methoxy units 
replaced by phenyl groups (25). Using Ir4 as 
the precatalyst produced 2a in >99% yield and 
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Fig. 1. Relevance of hydroxylamines and strategies for their enantioselective 
production by oxime reductions. (A) Marketed biologically active compounds 
containing N—-O bonds either lack chirality or are sold as racemates. (B) Metal- 
catalyzed asymmetric oxime hydrogenations lead to undesired primary amine 
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products. (C) Asymmetric reductions with stoichiometric chiral oxazaborolidine 
reagents proceed with partial cleavage of the N-O bond, are expensive, and 
are difficult to scale up. (D) Iridium(IIl) complexes enable the enantioselective 
hydroxylamine synthesis via the developed asymmetric hydrogenation platform. 


94:6 er. ‘H-NMR analysis of lr4 in tetrahydro- 
furan (THF)-dg revealed a 70:30 diastereomeric 
mixture provided by the iridium stereogenic 
center. Upon in situ hydride incorporation, 
complex Ir4-H was detected in 96:4 dia- 
stereomeric ratio (d.r.) (fig. $3). This likely 
represents the ceiling of achievable selectiv- 
ity. X-ray crystallographic analysis of related 
complex Ir3b-I provided additional struc- 
tural insights of the complex in the solid state 
(fig. S4). 

The hydrogenation proceeds in a variety of 
solvents. However, the catalyst reactivity and 
selectivity were lower in aprotic solvents such as 
toluene and THF (Fig. 2, entries 4 and 5). Among 
alcoholic solvents, tert-amyl alcohol (tAmylOH) 
was superior, giving excellent levels of enantio- 
selectivity (97.5:2.5 e.r.) and conversion (98%) 
for the reduction of E-la to 2a (entry 3), where- 
as the reduction of Z-1a under otherwise iden- 
tical conditions resulted in a modest conversion 
and modest enantioselectivity (entry 7). In 
line with previous reports (26), this indicates a 
strong impact of the oxime E/Z stereochemistry 
on the reaction outcome, with £-la providing 
superior reactivity and selectivity. Because pro- 
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tonated oximes can isomerize by nucleophilic 
addition/elimination to the C=N double bond, 
the choice of the alcoholic solvent and the 
Bronsted acid is key to achieving high selec- 
tivity by tuning the substrate E/Z equilibration 
rate versus the hydrogenation rate. Pure E-la 
isomer equilibrates to an 80:20 E/Z-mixture 
in the absence of Ir4 (entry 6). Using the cat- 
alyst in AmylOH produced 2a in 97.5:2.5 e.r., 
irrespective of the reaction time (4 or 20 hours); 
this indicates that the hydrogenation is faster 
than oxime isomerization (entries 3 and 9). 
Performing the reduction in methanol (MeOH), 
which triggers a faster oxime isomerization, 
caused a drop in selectivity to 80:20 er. (entry 2). 
In contrast, 1.0 equivalents of MsOH, the mini- 
mum required amount of acid, in tAmylOH 
afforded 2a in 98:2 e.r., although this was 
accompanied by incomplete conversion of 
F-1a (entry 8). 2,2,2-Trifluoroethanol dis- 
rupted essential interactions for stereocon- 
trol and yielded 2a in modest 54:46 e.r. 
(entry 1). Additional experiments showing 
the impact of the acid strength and stoichi- 
ometry on the reaction outcome are sum- 
marized in fig. $4. A high enantioselectivity 
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was maintained at 0.25 mol % catalyst load- 
ing (entry 10) or with lower hydrogen pres- 
sure (1 bar H,) (entry 11). With lower loadings, 
the risk of catalyst deactivation by chloride 
anion contamination increases. The chloride 
analog of (S)-Ir4: was completely catalytically 
incompetent as a result of the high Cl-Ir 
binding affinity (entry 12). The absolute con- 
figuration of 2a was confirmed by single-crystal 
x-ray analysis of its 4-nitrobenzenesulfonic 
acid salt. 

Using the optimized conditions with a slow 
E/Z-oxime equilibration regime, the acid- 
assisted enantioselective reduction was ap- 
plicable to a variety of oximes 1, producing the 
corresponding alkoxy amines 2 in excellent 
yields and enantioselectivity (Fig. 3). Contrast- 
ing the reported racemic B(CgF;)3-catalyzed 
oxime hydrogenation (27), the bulky tert-butoxy 
group of 1a is not mandatory. Substrates with 
O-methyl as well as other primary and sec- 
ondary O-alkyl substituents were quantitatively 
hydrogenated in high enantioselectivities, up 
to 97:3 e.r. Even free oximes were smoothly 
reduced to the corresponding hydroxylamine 
products without any detectable N-O bond 
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Solvent Yield %* — er(S:R)" 
TFE >99 54:46 
MeOH >99 80:20 
tAmylOH 98(97) 97.5:2.5 
toluene 86 87:13 
THF 95 86:14 
Deviation from entry 3 
No (S)Ar4 <148 - 
from Z-1a 60 75:25 
1.0 equiv. MSOH 64 98:2 
4h 53 97.5:2.5 
0.25 mol% (S)-Ir4 42 95:5 
1 bar Haq) 24 92:8 
with (S)-Ir4-Cl <1 - 


Fig. 2. Development of the iridium precatalyst and reaction optimization. Conditions: E-la, 1 mol % Ir, 50 bar Ho, 1.5 equiv MsOH, 0.5 M in iPrOH, 23°C, 


20 hours, basic work-up to free 2a. 
chiral HPLC. Isolated yield. 


scission, as exemplified by 2b. Despite the 
acidic reaction conditions, an acid labile acetal 
moiety (2c) and O-benzyl group susceptible to 
hydrogenolysis (2d, 2e) did not interfere and 
remained intact. Substrate selectivity control 
by existing stereocenters at the ether oxygen 
atom, as exemplified by le with the (S)-a- 
methylbenzyl group, could be overridden 
by the iridium catalyst. Product 2e was ob- 
tained either in a 4:1 d.r. for the mismatched 
case with (S)-Ir4 or in a 7:93 d.r. using the 
matched catalyst isomer (R)-Ir4&. In general, 
the enantiomeric purity of products 2 could be 
upgraded by simple recrystallization of their 
salts, as shown for compound (S)-2g (from 
93:7 e.r. to 99:1 e.r. with 81% recovery). When 
starting from pure Z-1g isomer, the antipode 
(R)-2g was formed in 20:80 e.r. Dialkyl oximes 
Il and Im bearing 2° and 3° alkyl substituents 
were equally well reduced, giving for instance 
N-2m, the N-methoxy derivative of rimanta- 
dine (28) in 93:7 e.r. Related substrates with 
adjacent hydroxyl or tosylate groups at the 
a-carbon atom were compatible, affording 
reduced products 2n and 20 in excellent 
yields and high selectivities. Another salient 
application is the access to N-alkoxy amino 
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*Determined by 'H-NMR with internal standard; yield of 2a virtually identical to the consumption of 1a. 
§>99% of la recovered (4:1 F:Z). 


acid derivatives, as shown for valine analog 
2p formed in 98% yield and 90:10 e.r. The 
protocol works with D, generated by Skrydstr- 
up’s two-chamber setup, thus qualifying 
for practical deuterium isotopic labeling of 
valuable hydroxylamine scaffolds. Accord- 
ingly, benzoxazine 2q-d, was synthesized in 
80% yield and 97:3 e.r. with 92% deuterium 
incorporation. 

We next targeted the synthesis of N-alkoxy 
derivatives of valuable chiral a-branched ben- 
zylamines (29). No stereocontrol was achieved 
with 4-methoxy acetophenone derived oxime 
E-\j, which equilibrates to a 9:1 E:Z isomeric 
mixture under the reaction conditions. Steri- 
cally more demanding Z-1k, with a locked 
Z-configuration due to the tert-butyl substitu- 
ent, gave 2k in 92:8 e.r. To further investigate 
the impact of the aryl oxime stereochemistry 
on the reaction, we individually subjected the 
separated isomers Z-Ire and E-Ire to condi- 
tions for fast hydrogenation (3 mol % Ir4&) 
and slow isomerization (iPrOH, 1.0 equiv of 
MsOH, 2 hours) (Fig. 3B). The diastereoisomer 
having the N-OR moiety trans to the large 
substituent (here Z-Ire) reacted much faster 
and more selectively to 2re (99%, 97:3 e.r.) 
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than did E-Ire (22%, 70:30 e.r.), which suggests 
that actually only ~7% of isomer E-Ire was 
reduced. Given this behavior, we hypothesized 
that using conditions with a fast E/Z equili- 
bration regime would be most beneficial. In- 
deed, hydrogenation of a 1:1 E:Z mixture of 
Ire under equilibrating conditions [ethanol 
(EtOH), 1.5 equiv of MsOH] formed 2re in 
quantitative yield and 92:8 e.r. 

A variety of substrates ra to Irk were con- 
veniently hydrogenated as the unresolved 1:1 
E/Z mixtures, forming the corresponding N- 
methoxy amines in excellent yields and com- 
parably high enantioselectivities. This substrate 
type allowed the demonstration of the unique 
chemoselectivity and functional group toler- 
ance of the acid-assisted reduction method. 
Whereas most transition metal-catalyzed hy- 
drogenation methods frequently reduce aro- 
matic bromo, vinyl, and nitro as well as azido 
groups, these remarkably remained untouched 
under our reaction conditions. Moreover, the 
pinacol boronate group of 2rk survived, serving 
as a potential handle for subsequent cross- 
coupling reactions. Nitrogen and sulfur het- 
eroarenes 2re and 2rf were also compatible. 
(See fig. S6 for tests of additional functional 
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groups.) Enzyme inhibitor Irg (30) could be 
reduced in quantitative yield and 93:7 e.r., 
illustrating the applicability of the method 
for late-stage derivatization. Existing stereo- 
genic centers adjacent to the C=N bond ex- 
hibit a strong substrate control. For instance, 
(R}-1rl possessing a B-methyl stereogenic cen- 
ter yielded cis-isomer (R,R)-2rl in 97:3 d.r. and 
93:7 e.r. In contrast, B-ester analog (+)-Irm 
gave trans-isomer 2rm in 98:2 dr, likely by 
cis-hydrogenation followed by acid-promoted 
epimerization of the ester to the thermodynam- 
ically more stable product. N-methoxy deriva- 
tives of norephedrine 2s and estrone 2t were 
formed in highly diastereoselective fashion, the 
former after double hydrogenation of Is. Fi- 
nally, a 25-g batch of E/Z-Irb was hydrogenated 
with a reduced 0.05 mol % catalyst loading 
to (R)-2rb in a quantitative manner (turnover 
number 4000) in 93:7 e.r., supporting the 
scalability of the method. Overall, our find- 
ings serve as a blueprint for further asymmetric 
transition metal-catalyzed ionic hydrogena- 
tions of challenging substrates, and highlight 
the yet untapped potential of cyclometalated 
chiral Cp* metal complexes in catalysis. 
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Nontargeted mass-spectral detection of 
chloroperfluoropolyether carboxylates in 


New Jersey soils 


John W. Washington", Charlita G. Rosal’, James P. McCord, Mark J. Strynar”, Andrew B. Lindstrom?, 
Erica L. Bergman*, Sandra M. Goodrow’, Haile K. Tadesse”, Andrew N. Pilant?, 
Benjamin J. Washington®, Mary J. Davis!, Brittany G. Stuart®, Thomas M. Jenkins’ 


The toxicity and environmental persistence of anthropogenic per- and poly-fluoroalkyl 

substances (PFAS) are of global concern. To address legacy PFAS concerns in the United States, 
industry developed numerous replacement PFAS that commonly are treated as confidential 
information. To investigate the distribution of PFAS in New Jersey, soils collected from across 

the state were subjected to nontargeted mass-spectral analyses. Ten chloroperfluoropolyether 
carboxylates were tentatively identified, with at least three congeners in all samples. Nine 
congeners are 2(CF2)7. Distinct chemical formulas and structures, as well as geographic distribution, 
suggest airborne transport from an industrial source. Lighter congeners dispersed more widely 
than heavier congeners, with the most widely dispersed detected in an in-stock New Hampshire 
sample. Additional data were used to develop a legacy-PFAS fingerprint for historical PFAS 


sources in New Jersey. 


er- and poly-fluoroalkyl substances (PFAS) 

are anthropogenic compounds used to 

impart surfactant, antistaining, antistick- 

ing, and related properties to a wide array 

of consumer and industrial products. 
Spurred by concerns regarding potential tox- 
icity and environmental persistence of long- 
chain PFAS (J-5), in 2006 the U.S. Environmental 
Protection Agency (EPA) and eight leading 
PFAS manufacturers and users negotiated 
a voluntary “PFOA Stewardship Program” in 
which the companies agreed to work toward the 
elimination of perfluorooctanoate acid (PFOA, 
or C8), as well as C8 precursors and related 
longer-chain homologs from emissions and 
product content by 2015. With establishment 
of the PFOA Stewardship Program, numer- 
ous PFAS manufacturers and users initiated 
efforts to develop substitute compounds for 
legacy long-chain PFAS, commonly settling on 
structures that are treated as confidential 
business information. With proliferation of 
these substitute PFAS, environmental chem- 
ists have set about attempting to identify them 
using nontargeted, high-resolution mass spec- 
trometry (HRMS) to assemble formulas and 
likely structures from molecular-precursor and 
-fragment data (6). High mass-resolution 
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enables chemists to identify those molecu- 
lar formulas that have exact masses within 
a user-specified mass-error threshold, and 
molecular-fragment masses and spectra of 
the molecules help narrow possible formu- 
las further, ideally informing molecular struc- 
ture as well (7). 

Among participants in the PFOA Steward- 
ship Program, several have operated industrial 
facilities, ongoing or in the past, in or near 
densely populated New Jersey. As part of ef- 
forts to elucidate industrial chemical sources, 
chemical species, and distribution of legacy 
and possible substitute PFAS in New Jersey, 
in late 2017 the New Jersey Department of 
Environmental Protection (NJDEP) collected 
soil samples. For this survey, samples primarily 
were collected in southern New Jersey, where 
two PFOA Stewardship Program signatories 
are located: Solvay, in West Deptford Township, 
and DuPont (now Chemours), in Pennsville 
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Fig. 1. A chloroperfluoropolyether carboxylate 
(CIPFPECA) identified by nontargeted MS analyses 
in soil samples from New Jersey. In the New Jersey 
samples, perfluoroethy! (e) plus perfluoropropy! 

(p) groups were observed to range in sum from one to 
four. The example congener depicted here would 

be designated (e,p) = 1,1. Isomers likely include an 
alternative terminal structure of CICF2CF(CF3)0- 

(13, 14) as well as relative positions for the 
perflluoroethy! and perfluoropropy! groups. 
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Township. Historically, Solvay produced poly- 
vinylidene fluoride (PVDF), which entailed use 
of Surflon, a surfactant that contains C9, C11, 
and C13 (perfluorononanoate, perfluorounde- 
canoate, and perfluorotridecanoate) perfluor- 
ocarboxylates (PFCAs) (8). By contrast, the 
DuPont/Chemours facility manufactured 
and used fluorotelomers [compounds syn- 
thesized from perfluoroalkyl iodide, composed 
of perfluorinated-carbon straight chains such 
as F(CF,)¢-, and usually two-hydrogen-bearing 
carbons, such as -CH,CH,-] from 1962 until no 
later than 2014 (9). Sampling transects were 
collected in the dominant downwind directions 
as recorded at nearby Philadelphia Interna- 
tional Airport, and remote locations around 
the state were sampled as well (sampling cam- 
paign details are available in the supplemen- 
tary materials). These samples were sent to the 
EPA, Office of Research and Development (ORD) 
laboratory in Athens, Georgia. 

At the ORD laboratory, soil samples were 
extracted (supplementary text, fig. S1, and 
table S1) in triplicate and selected samples 
analyzed (supplementary text) for PEAS un- 
known to our research team by using an ultra- 
performance liquid chromatograph (UPLC) 
coupled to a quadrupole time-of-flight (QToF) 
mass spectrometer operating in negative 
electrospray ionization (ESI), MS° (no mass 
filtering) mode. Output data were sorted by 
signal intensity, high-intensity molecular fea- 
tures were plotted on mass-defect plots (7) 
ranging in defect from -0.10 to +0.05 Da, 
and molecular features appearing in the plots 
of multiple samples were selected for further 
scrutiny. Using low-collision-energy precur- 
sor masses, high-collision-energy fragment 
masses, a distinctive mono-chloro M+2 spec- 
tral feature, and carbon-isotopic ratios (10), 
we tentatively identified a molecular feature 
as a chloroperfluoropolyether carboxylate 
(CIPFPECA) that is described in the literature 
as “Solvay’s product (CAS No. 329238-24-6)” 
(11), as reported in a product assessment by 
the European Food Safety Authority (EFSA) 
at the request of “Solvay Solexis, Italy” (12). 
With these reports, together with compound- 
synthesis papers by Solvay chemists (73, 14), 
the structure of these CIPFPECAs appears to 
be as shown in Fig. 1 for 70% of production, 
with 30% having an alternative terminus of 
CICF,CF(CF3)0-. 

We have not had access to a standard of the 
Solvay product. However, on the basis of ten- 
tative identification of one Solvay product 
congener in our data, and the literature report 
that CIPFPECA congeners can include 0 to 
2 perfluoroethyl groups (Fig. 1, e) and 1 to 
4 perfluoropropyl groups (Fig. 1, p) (17, 12) 
separated by ether linkages, we carried out 
suspect screening of our MS° data by ex- 
tracting hypothetical masses to determine 
what other congeners might be present. After 


lof5 


RESEARCH | REPORT 


this effort, all tentatively identified congeners 
were further elucidated on the QToF operat- 
ing in MS/MS mode, in which the quadrupole 
magnets were focused on suspected precursor 
mass/charge ratio (7m/z) values and fragmented 
with ramped collision energy; then, precursors 
and fragments were isolated and detected in 
the ToF (supplementary text). Results for the 
nine CIPFPECA congeners tentatively identi- 
fied on QToF are depicted in Fig. 2 and fig. S2. 
Within conventional HRMS-identification con- 
fidence context (15, 16), these compounds fall at 
level 2b (diagnostic probable structure) and 
level 3 (tentative candidate), but considering 
the nine congeners together, confidence of their 
general identity is high. 

Having tentatively identified nine congeners 
in these New Jersey soil samples as Solvay’s 
product, we reexamined in-house nontargeted 
results for a water sample from the Bormida di 
Spigno River, downstream of Solvay Specialty 
Polymers Italy (Spinetta Marengo, Alessandria, 
Italy). In this Italian water sample, we identified 
five CIPFPECA congeners (fig. S3) that were con- 
sistent with our New Jersey soil samples, bolster- 
ing confidence still further in our identification 
of these compounds as Solvay’s product. 

Informed by the fragmentation patterns of 
the QToF suspect screening, we developed a 
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Fig. 2. Mass chromatograms (MS/MS mode), spectra, and precursor and 
fragment structures of four smaller CIPFPECA congeners detected in 
New Jersey samples. These are identified in the top left of the chromato- 
grams by ethyl#,propyl#. Results for larger congeners are shown in fig. S2. 
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method for routine analysis of the detected 
congeners on a conventional-resolution tan- 
dem mass spectrometer [liquid chromatogra- 
phy (LC)-MS/MS], adding monitoring for a 
possible ethyl,propyl (e,p)=1,0 congener (fig. 
S4 and table $2). Whereas this method was 
not developed with the benefit of authentic 
standards, it was informed by masses for ~30 
precursors and fragments uniformly having 
mass error <4 mDa when the MS signal is 
210° (fig. S5). With an objective of assessing 
relative concentrations among samples, we 
performed analyses on the triplicate soil ex- 
tracts with a matrix internal standard labeled 
with five heavy carbons, “C;-perfluorononanoic 
acid (°C,-PFNA; 'C;-C9), then reported 
CIPFPECAs “as C9,” by simple peak-area ratios 
(supplementary text). We also performed LC- 
MS/MS analyses on the triplicate soil-extract 
replicates for legacy PFCAs, quantitating 
on mass-labeled internal matrix standards 
(supplementary text). Results of CIPFPECA 
analyses are summarized in table $4, and 
PFCA analyses are summarized in table S5. 
Of the 10 congeners we identified by means 
of QToF or tandem MS, (i) six were expected 
on the basis of EFSA information (e,p=0,1; 1,1; 
0,2; 2,1; 1,2; and 0,3 congeners) (JI, 12); (ii) 
four were not included as congeners in the 
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EFSA information (1,0; 2,0; 3,0; and 4,0 con- 
geners); and (iii) six congeners anticipated on 
the basis of EFSA information were not detected 
(2,2; 1,3; 2,3; 0,45 1,4; and 2,4 congeners) (fig. S6). 
In fig. S7, we summarize the fractional com- 
position of the 10 CIPFPECA congeners detected 
in our study in terms of mean, maximum, and 
minimum fraction observed among our soil 
samples. Addressing the mean fractions, at 
roughly 40% each, the e,p = 0,1 and 1,1 con- 
geners are dominant, followed by ~15% for the 
0,2 and lesser to trace amounts of all other 
congeners (fig. S7). 

Several CIPFPECAs eluted as split peaks 
(Fig. 2 and fig. S2). We investigated whether 
this splitting reflected the presence of isomers 
by extracting spectral patterns of visually dis- 
tinct chromatographic peak ranges, looking 
for unique fragmentation patterns across ag- 
gregate peaks (supplementary text, qualitative 
examination for isomers, and figs. S8 to S10). 
On the basis of these efforts, we suspect the 
presence of group-regioisomerism for con- 
geners having both ethyl and propyl groups as 
well as regioisomers based on chlorine position 
(Fig. 1). 

These New Jersey soil samples generally 
were elevated in legacy PFCAs relative to global 
background soil estimates (17) and particularly 
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Chromatogram peaks consist of signal from precursors and selected 
major fragments. Congeners elute in order according to molecular mass, 
small to large. On major spectra, the diagnostic monochlorine signal is 
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elevated in C9 and longer homologs. For ex- 
ample, the mean C9 in our New Jersey soils is 
785 pg/g dry soil (table S5) [compared with 
global background of 18 pg/g (17)]; mean 
C10 = 437 pg/g (perfluorodecanoate; back- 
ground = 11 pg/g); mean Cll = 1618 pg/g 
(background = 9.6 pg/g); mean C12 = 167 pg/g 
(perfluorodecanoate; background = 9.0 pg/g); 
and mean C13 = 222 pg/g (background not re- 
ported). Also, the lowest New Jersey soil 
concentrations in our study for C9 through 
C12 PFCAs (table S5) were 5- to 30-fold that 
of mean global background values (17). These 
increased long-chain concentrations resulted 
in an anomalous PFCA-homolog profile for 
the New Jersey samples relative to global back- 
ground. Whereas the PFCA profile for global 
background soils tended to be highest in C6, 
C7, and C8 PFCAs (perfluorohexanoate, per- 
fluoroheptanoate, and perfluorooctanoate), 
in this order, these New Jersey samples were 
most highly represented by C11 and C9, in this 
order (fig. S11). 

Taken altogether, these data for CIPFPECAs 
and the elevated levels of legacy PFAS strongly 
suggest the presence of regional PFAS sources. 

Probing for possible relationships suggested 
by variation in the data, we performed prin- 
cipal component analysis (PCA) to guide di- 
rected testing (fig. $12). Principal component 


Fig. 3. Concentration profile. (A) Log 0,1-congener 
soil concentration (picograms/gram) versus log 
distance from Solvay (kilometers). The regression 
statistics are for the New Jersey soil samples 

(blue) located as far as 150 km removed from Solvay 
(table S1). Other CIPFPECA congeners are still more 
highly correlated with distance from Solvay (Table 1). 
Also shown is the 0,1 congener detected in a soil from 
Merrimack, New Hampshire, at 12.1 pg/g (orange), 
some 460 km distant from Solvay (table S1), falling 
closely proximate to the regression line for New Jersey 
0,1 congeners. The 0,1 congener is the most widely 
dispersed of the CIPFPECAs (B) and the only 
CIPFPECA detected in the New Hampshire soil. 
Inclusion of the New Hampshire data point in the 
egression [coefficient of determination (R*) = 0.55: 
P = 10°°] increases the significance of the relationship 
oughly an order of magnitude beyond that of 

New Jersey data alone. (B) Regression slope 

(log [CIPFPECA] versus log distance from Solvay) for 
each of eight CIPFPECA congeners versus congener 
molecular mass. Given the statistically significant 
elationship (P = 0.001), this observation suggests 
sorting by molecular mass in an atmospheric plume, 
with lighter molecules generally being dispersed 

more remotely than heavier molecules. Mechanisms 
of atmospheric mass sorting remain uncertain, 

but the regression slope also is correlated with 
congener-acid vapor pressure (R* = 0.91; P < 0.001) 
and congener-anion octanol-water partition coefficient 
(R? = 0.92; P < 0.001), as estimated by the EPA 
Chemical Transformation Simulator (25). 
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1(PC1) and PC2 account for 96.8% of variation 
in the data, with PC1 alone accounting for 
90.6%. The 95% confidence interval ellipsoids 
in the PCA score plot (fig. S12) encompass the 
two chemical families almost exclusively: the 
CIPFPECAs and the legacy PFCAs. The major 
ellipsoidal axis of the CIPFPECA cluster is ori- 
ented more closely parallel to PC1, reflecting 
considerable variance among these data that 
can be characterized dominantly by a single 
component, as might be expected for a single 
physical source. Additionally, C11 and C13 fall 
within the CIPFPECA ellipsoid (fig. $12), suggest- 
ing similarities in the pattern of variation for C11 
and C13 with at least some of the CIPFPECAs. 

Exploring variation in the CIPFPECA data 
(fig. S12), we regressed the eight CIPFPECA 
congeners detected in most samples (exclud- 
ing rarely detected 1,0 and 4,0 congeners) 
against distance from Solvay in log-transformed 
space (Fig. 3A). All eight congeners decreased 
with distance from Solvay with high degrees 
of significance (P < 0.0002) (Table 1). Examining 
the data in three dimensions, the CIPFPECA 
concentration contours form a concentric focus 
on Solvay, which is consistent with Solvay being 
the source of these compounds (Fig. 4). The 
slope of diminishing concentration with dis- 
tance from Solvay (Table 1) also increases with 


molecular mass (P < 0.001) (Fig. 3B), suggest- 


ing that smaller congeners were dispersed more 
widely than larger congeners. This sorting by 
mass might be a factor in the absence of our 
detection of several of the largest CIPFPECA 
congeners expected for the Solvay product (fig. 
S6) (12); the heaviest congener we detected is 
the e,p = 0,3 at 792.9 Da, and the lightest of the 
six congeners expected, but not detected (fig. 
S6), was the 2,2, with a mass of 858.9 Da. 

Considering that these soil samples chiefly 
are from positions that are not hydraulically 
downgradient in the watershed of any Solvay 
wastewater discharge (Fig. 4 and fig. S1), aqueous 
discharge cannot explain these observations, 
so these correlations strongly suggest atmo- 
spheric release from Solvay as the principal 
mode of occurrence for these soils. 

The observation that three of the lightest 
congeners (0,1; 1,1; and 0,2) were detected in 
all study samples, including the most remote 
New Jersey sample near the northern state 
border (sample SS22) (fig. S1), suggests that 
light congeners might be dispersed beyond 
New Jersey state boundaries. To explore this 
possibility, we analyzed an in-stock sample 
from Merrimack, New Hampshire, that falls 
roughly parallel with the downwind transect 
extending northeasterly from Solvay (fig. S13). 
To determine whether unrelated samples might 
have CIPFPECAs, we also analyzed an in-stock 
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Table 1. Regression statistics for chemical data (picograms/gram) against distance from selected facilities in log-transformed space. ND, not 
detected; n, sample count; Nonsig., nonsignificant; PFUA, perfluoroundecanoic acid; PFDA, perfluorodecanoic acid; PFDoA, perfluorododecanoic acid; PFTeA, 


perfluorotetradecanoic acid. 


Analyte 
atmospheric 
precursor* 


Compound(s) 
D =0,n = 24) 


Distance from Solvay (km) 
(maximum n = 24) 


Pearson R Pr 
0.688 2.0 x10 


Slope 
-0.662 


Compound(s) 


Distance from Chemours (km) 
(anomalous background SS22 
excluded; n = 23) 


Pearson R Py 


12 (ND =7, = 17) 


0.766 B25 Ore OSI 


0.846 1.9 x 105 


~1.662 


0.796 BS SOS — =O a7/ 


0.482 LAO O26 


Ball 8 IO? 


0.519 47x103  -0.324 


Nonsig. 


(Cli + C13) - (C10 + C12) 


*Source, Ellis et al. (20). Significance level. 


sample from Conyers, Georgia, which is roughly 
1000 km southwest from Solvay (fig. S13). We 
detected the 0,1 congener in the downwind 
Merrimack sample (Fig. 3A) and no other con- 
geners, and we detected no CIPFPECAs in the 
remote Conyers sample. The 0,1 congener is the 
most widely dispersed (Fig. 3B and Table 1), 
and the New Hampshire sample, some 450 km 
removed, plots closely proximate to the re- 
gression line for the 0,1 congener in New Jersey 
samples as a function of distance to Solvay (Fig. 
3A). However, whether this New Hampshire 
0,1-congener detection is from Solvay or some 
unknown source requires more study. 

Given the role of Solvay as potentially the 
dominant or sole source of CIPFPECAs in our 
study, plots of legacy PFCAs against CIPFPECAs 
potentially guide which, if any, legacy PFCAs 
remain diagnostic of pre-Stewardship Solvay 
releases. Plotting concentrations of each legacy 
PFCA, chain lengths C4 (perfluorobutanoic 
acid) through C13 [perfluorotridecanoic acid 
(PFTrA)], against the sum of CIPFPECAs in 
fig. S14 shows three samples from closely 
proximate to Solvay that are high in CIPFPECAs 
also are high in C9, C11, and C13 PFCAs. On the 
basis of this observation, C9, C11, and C13 were 
regressed against distance from Solvay. Results 
of these regressions indicated that C9 is not 
correlated with distance from Solvay, but con- 
sistent with the PCA (fig. S12), C11 (P = 1.2 x 
107°) and C13 (P = 1.7 x 10°”) were statistically 
related with distance from Solvay (Table 1 and 
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0.660 4.5 x 104 


fig. S15). The seeming inconsistency of C9 
plotting anomalously in fig. S14 but not 
being statistically related to distance from 
Solvay is likely due in large part to the rela- 
tively much higher mobility of C9 than C11 and 
C13 in soils. For example, in a study of PFCAs 
in Decatur, Alabama, soils, Washington et al. 
(8) reported deep-to-surface soil ratios for C9 
as high as 50-fold that of C11 or C13, suggest- 
ing much higher rates of loss for C9 than C11 
and C13 from surface soils through leaching 
and percolation. 

Although figs. S14 and S15 and Table 1 suggest 
a relationship of C11 and C13 with Solvay, con- 
siderable spread remains in the data (fig. S15), 
perhaps reflecting noise imparted from other 
sources. The majority of all environmental 
releases of PFCAs longer than C8 from 1951 to 
2015 arose from fluorotelomer- and C9-based 
products (79). According to smog-chamber ex- 
periments (20) and global-scale modeling that 
used a complex suite of kinetic constants es- 
timated from literature (21), atmospheric oxida- 
tion of n:2FTOHs (where 7 is an even integer 
and FTOHSs are fluorotelomer alcohols) yields 
roughly equimolar nPFCAs and (n+1)PFCAs or 
preferentially nPFCAs in urban areas where 
nitrogen oxides can be elevated. In soils, micro- 
bially mediated degradation of n:2FTOHs has 
been shown to proceed through beta oxidation 
to yield dominantly nPFCAs (22, 23). Consist- 
ent with these studies, in their global soil 
survey, Rankin e¢ al. (24) reported that C8/C9 
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[nPFCA/(n+1)PFCA] ratios commonly fall in 
roughly equimolar to dominantly C8 (nPFCA) 
range and argued atmospheric or soil degrada- 
tion of fluorotelomers as a dominant mode of 
PFCAs occurrence globally. Given (i) histori- 
cal production and use of fluorotelomers at 
the large-scale New Jersey Chemours facility, 
i) the generally prevalent contribution of fluo- 
rotelomers to C10 and C12, and (iii) atmospheric 
and soil fluorotelomer-degradation stoichiome- 
try favoring roughly equimolar or dominantly 
even-chain PFCAs, the difference of nPFCAs 
minus (7+1)PFCAs, (C11 + C13) - (C10 + C12), 
has the potential to deconvolute potential sig- 
nals from Solvay and Chemours for these legacy 
PFAS. Large positive excesses in this difference 
suggest direct release of Cll and C13 PFCAs, 
whereas near-zero or negative values of this 
difference would be consistent with atmospheric 
or soil degradation of fluorotelomer precursors 
as a source. 

Applying the difference (C11 + C13) - (C10 + 
C12) to our New Jersey soil data accentuates 
signal to noise in that the strength of corre- 
lation with distance from Solvay (fig. S16) in- 
creases nearly an order of magnitude beyond 
that of Cll or C13 alone, with P = 4.5 x 10* 
(Table 1). (C11+C13) - (C10+C12) is plotted in 
fig. S17 as a function of the sum of CIPFPECAs, 
illustrating a relationship significant at P = 
4.0 x 10° and bolstering that these param- 
eters reflect a common mode of occurrence: 
airborne transport. 


4 of 5 


RESEARCH | REPORT 


75°30'0"W 


40°0'0"N 


Pennsylvania 


Delaware 


= 
© 
wo 
st 
° 

@ 
ro) 


New Jersey 


39°30'0"N 


e Sample locations 


Contour 
[_] Model area 


[| New Jersey 


75°30'0"W 


75°15'0"W 


75°15'0"W 


75°0'0"W 


40°0'0"N 


39°45'0"N 


39°30'0"N 


75°0'0"W 


Fig. 4. Geographic distribution. Shown are °CIPFPECAs in surface soils (picograms/gram). Contour 

lines were generated by using an algorthim in ArcMAP 10.6.1 that weighted the five nearest data points 
according to inverse-square distance. Despite some geographic sporadicity in the data and numerical 
artifacts where data are sparsely spaced, taken as a group the contours depict a clear pattern of increasing 


DCIPFPECAs with proximity to Solvay. 


Contours of the difference (C11 + C13) - (C10 + 
C12) are mapped in fig. S18. The resulting pat- 
tern depicts a strongly expressed positive anom- 
aly focusing on Solvay as well as a negative 
anomaly proximate to Chemours, a pattern that 
is consistent with the reasoning above (fig. S18). 
These results are consistent with values re- 
ported in Rankin e¢ al. (24) in that three of four 
samples collected ~20 km southeast of Chemours 
calculate to negative values for the difference 
(C11 + C13) - (C10 + C12). Taken altogether 
then, the difference (C11 + C13) - (C10 + C12) 
evidently fingerprints two potential PFAS 
sources in concert by accentuating differences 
in mode of occurrence: direct odd-chain PFCA 
release from the Solvay facility versus fluoro- 
telomer degradation in the atmosphere or soil 
from the Chemours facility. 
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Here, we have reported tentative identifica- 
tion of 10 CIPFPECA congeners distributed 
across an expansive breadth of soils in densely 
populated New Jersey and likely beyond. In 
light of these findings, numerous near-term 
pressing uncertainties merit investigation, in- 
cluding the presence and mobility of the con- 
geners in soil profiles, in surface and ground 
waters, in vegetation (such as agricultural crops), 
and in animals including humans, as well as 
whether there is evidence that these CIPFPECAs 
degrade in the environment. In the longer term, 
investigation of whether these CIPFPECAs might 
be toxic is prudent. 
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Restoring light sensitivity using tunable 


near-infrared sensors 


Dasha Nelidova’’, Rei K. Morikawa”, Cameron S. Cowan?”, Zoltan Raics'’, David Goldblum*, 
Hendrik P. N. Scholl34, Tamas Szikra’?, Arnold Szabo®, Daniel Hillier2*7**, Botond Roskab2>* 


Enabling near-infrared light sensitivity in a blind human retina may supplement or restore visual function 
in patients with regional retinal degeneration. We induced near-infrared light sensitivity using gold 
nanorods bound to temperature-sensitive engineered transient receptor potential (TRP) channels. We 
expressed mammalian or snake TRP channels in light-insensitive retinal cones in a mouse model of 
retinal degeneration. Near-infrared stimulation increased activity in cones, ganglion cell layer neurons, 
and cortical neurons, and enabled mice to perform a learned light-driven behavior. We tuned responses 
to different wavelengths, by using nanorods of different lengths, and to different radiant powers, by 
using engineered channels with different temperature thresholds. We targeted TRP channels to human 
retinas, which allowed the postmortem activation of different cell types by near-infrared light. 


hotoreceptor degeneration, including 

age-related macular degeneration and 

retinitis pigmentosa, is the leading cause 

of blindness in industrialized countries. 

When cone photoreceptors lose light 
sensitivity, high-resolution vision is affected, 
and it becomes difficult to carry out the ac- 
tivities of daily living. In most cases, photo- 
receptor degeneration is incomplete, leading 
to the presence of light-sensitive and light- 
insensitive photoreceptor zones next to each 
other within the same retina. Remaining light- 
sensitive regions limit the utility of opto- 
genetic (J) or light-switch (2) therapies, because 
these technologies require bright, visible light 
that saturates photoreceptors. 

Enabling the detection of near-infrared 
(NIR) light (>900 nm) at wavelengths outside 
the spectrum visible to the human eye (390 to 
700 nm) could provide a way of supplement- 
ing or restoring light sensitivity in the affected 
retinal region without interfering with the vi- 
sion that remains. Currently, there is no tech- 
nology that would allow the induction of NIR 
sensitivity in a blind retina. 

A few species, such as boas, pythons, and pit 
vipers, can detect infrared light (1 to 30 um) 
using temperature-sensitive transient receptor 
potential (TRP) cation channels expressed in 
a specialized organ (3). Thermal and visual 
images superimpose in the snake’s brain (4), 
presumably enabling the snake to react to the 
environment with greater precision than what 
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is possible using only a single image. TRP chan- 
nels could potentially be targeted to retinal cell 
types to make them sensitive to infrared ra- 
diation. However, heat transfer to ectopically 
expressed TRP channels via direct NIR illumi- 
nation is inefficient, requiring high intensities 
that would damage the retina. 

To develop a more efficient NIR light de- 
tector for retinal cell types, we engineered a 
dual system consisting of a genetic component 
and a nanomaterial component (Fig. 1A). The 
genetic component consisted of temperature- 
sensitive TRP channels, engineered to incor- 
porate an extracellular epitope recognizable 
by a specific antibody (5). The nanomaterial 
component consisted of gold nanorods conju- 
gated to an antibody against the epitope (6). 
This system uses surface plasmon resonance 
for heat transfer (7): Gold nanorods capture 
NIR light at their resonant wavelength and 
produce heat, which is harnessed to open TRP 
channels in the proximity of the nanorods. The 
epitope ensures nanorod binding to engi- 
neered rather than native TRP channels, be- 
cause some TRP channels are expressed in 
the retina (8, 9). 

We developed a system based on rat TRP 
family V member 1 (TRPV1) channels and 
gold nanorods with absorption maxima (Agp;) 
at 915 nm; this value was selected to ensure 
low water absorption. We inserted a 6x-His 
epitope tag in the middle of the first TRPV1 
extracellular loop (Fig. 1, C and D), after amino 
acid 459 or 465 (fig. S1). Analysis of TRPV1 
structure suggested that insertion at these 
sites would not disrupt protein function. 

To measure whether tagged channels were 
functional, we performed whole-cell voltage 
clamp in human embryonic kidney (HEK) 
cells expressing TRPV1.459-6x-His, TRPV1.465- 
6x-His, or untagged TRPV1 while activating 
the channels by TRPV1 agonist capsaicin. The 
sizes of evoked currents were similar between 
TRPV1.465-6x-His and TRPVI (table S1, row A) 
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but smaller in TRPV1.459-6x-His (table S1, row 
B, and fig. S2). Therefore, we used TRPV1.465- 
6x-His (abbreviated as rTRPV1) in subsequent 
experiments. 

We targeted rTRPVI to cone photoreceptors 
of Pde6b'™ mice (known as rd1 mice) through 
subretinal injection of adeno-associated virus 
(AAV), using a photoreceptor-specific mouse 
cone arrestin promoter (mCar) to restrict ex- 
pression (Fig. 1E). Rd1 mice have severe photo- 
receptor degeneration, with complete loss of 
rods and dysfunctional, light-insensitive cone 
photoreceptors by 4 weeks of age (Fig. 1B) 
(10). Cell membrane expression of rTRPV1 was 
seen in 55 + 10% of rd1 cones (table S1, row C, 
and Fig. 1F). Among rTRPV1-positive cells, 
98 + 1.6% were cones (table S1, row D); these 
rTRPV1-positive cones expressed the 6x-His 
tag (table S1, row E, and Fig. 1, E and F). 

To measure whether NIR light drives re- 
sponses in rdi1 retinas, we performed two- 
photon calcium imaging of individual cone cell 
bodies and axon terminals as well as ganglion 
cell bodies in wholemount P56-P72 retinas 
under two conditions: (i) rTRPV1 with nano- 
rods (Agps = 915 nm) and (ii) rTRPV1 without 
nanorods. To measure whether NIR light affects 
normal cones, we performed two-photon cal- 
cium imaging of cone axon terminals in whole- 
mount wild-type retinas stimulating cones 
with NIR (915 nm) and/or visible light. To de- 
tect calcium signals in cones, we genetically 
targeted the calcium indicator GCaMP6s via 
an AAV that expresses GCaMP6s under a cone- 
specific promoter (7). For ganglion cells, we 
used the organic calcium sensor Oregon Green 
488 BAPTA-1 (OGB-1). 

rTRPV1-expressing rd1 cones showed 915-nm 
light (“NIR light”)-evoked increases of calcium 
signal in the presence of nanorods (“NIR cone 
response”) (Fig. 1, G to I). NIR cone response 
was of opposite polarity compared with the 
visible light response of wild-type cones (Fig. 
1G). Polarity reversal was due to cation selec- 
tivity of rTRPV1. NIR cone response was similar 
in size to the visible light response of wild- 
type cones (table S1, rows F and G). NIR light 
neither activated wild-type cones nor affected 
their visible light responses (fig. $3). rTRPV1- 
expressing cones without nanorods did not 
react to light (Fig. 1, G to I). In the presence of 
nanorods, NIR light also induced responses 
in neurons of the ganglion cell layer (GCL) (fig. 
S4). In all subsequent experiments, we used 
both the TRP channel and the nanorod com- 
ponent (“NIR sensor”); uninjected rd1 mice 
were used as the control. 

To assess whether NIR light-induced retinal 
activity propagates to higher visual centers, we 
generated rdl1 mice with targeted GCaMP6s 
expression in layer 4 of the primary visual 
cortex (V1). Layer 4 receives feedforward con- 
nections from the lateral geniculate nucleus. 
We performed two-photon calcium imaging 
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Fig. 1. NIR light responses in mouse cone A B 
photoreceptors. (A) Components of 
the NIR light sensor. Engineered TRP 
channels (blue) express protein epitope tags — Antibody 
(orange) in extracellular domains and 
bind antibody-conjugated gold nanorods. TRP channel 
(B) Left: Healthy retina. Photons are 
captured by outer segments (OS) 
of photoreceptor cells. Right: retinal 
degeneration, characterized by loss of OS 
and blindness. In the rdl mouse model 
of degeneration, rod cell bodies are lost but 
cone cell bodies persist. ONL, outer ~ 
nuclear layer; INL, inner nuclear layer; GCL, SS Ss fe = pan! 
ganglion cell layer. (© and D) Structure ; Sigs 
of rfRPV1 channel. (C) Top view. Orange 
arrows indicate a 6x-His epitope tag in each 
of the four subunits of the TRPV1 tetramer 
(yellow, blue, gray, and green). The red 
asterisk marks the channel pore. (D) Right: 
Side view. Left: 6x-His epitope tag (orange) 
within the first extracellular loop of the ’ ees 
blue subunit is enlarged. (E) Top row: a 
Top views of rdl retinas transduced with E F 
both rTRPV1 and nanorods and immuno- ee tg gg a (TRPV1 
stained for TRPV1 (left, blue), 6x-His i a EE iad _ & 6x-His 
(middle, orange), and merging the two : ; 
(right). Gray, Hoechst nuclear stain. Bottom 
row: Cross sections of the retinas shown 
in the top row. Scale bars, 25 um. 
(F) Number of rTRPV1-positive (blue) and 
6x-His—positive (orange) cones per square 
millimeter in rdl retinas transduced with both 
rTRPV1 and nanorods (n = 5 mice) or in 
control, uninjected rdl retinas (n = 5 mice). 
Dashed arrow indicates maximum cone 
density in rdl mice at postnatal day 70 (10). 
Each data point is collected from a G Rd1. rTRPV1-nanorods Rd1. rTRPV1 only Wild-type 
different region of a retina (three regions per 915 nm 915 nm 405 nm 
retina). (G) Example calcium responses 
(mean AF/F, where F represents baseline 
fluorescence; two to three repetitions) 
recorded from cone axon terminals in 
P56-P71 rd1 mice transduced with rTRPV1 
and nanorods (left, Aap; Nanorod = 915 nm), 
in rdl mice transduced with rTRPV1 only 
(middle), or in wild-type mice (right). 
TRP, n = 4 mice; wild-type, n = 3 mice. 
Stimulus, full-field NIR light (915 nm, logo H | 
light intensity = 18.9; left and middle) or 
visible light (405 nm, logio light intensity = 
14; right). Black bars (2 s) and arrows 
(100 ms) indicate stimulus timing. Two- 
photon images of GCaMP6s-expressing cone 
axon terminals (white circles) are shown 
to the left of the response curves. Scale bar, 
5 um. White asterisks indicate cell bodies. 
(H) Cumulative frequency of responding 
rTRPV1-transduced rdl cones with (black) 
and without (gray) nanorods (Aabs = 915 nm). Oo 
(I) Cone ae palreae: (AF/F). “ Loa._| at ee ie & ¥ 
Light intensities as in (G). Number of cones Sislnisknevonsemn™ Ss) ss wr 
A 
given as a ratio of responding cones to Wz 
measured cones. 
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Fig. 2. NIR light responses in mouse primary visual cortex. (A) Schematic of 
the experiments in the primary visual cortex (V1). Cortical neuron calcium 
responses to 100-ms full-field NIR light stimulation of the contralateral eye 
were recorded in P51-P83 rdl mice. (B) Example calcium responses (mean 
AF/F, five repetitions) to 915-nm light stimulation recorded in rd1 mice 
transduced with rTRPV1 and nanorods (left, Aap; = 915 nm, n = 3 mice) and in 
control, uninjected rdl mice (right, n = 5 mice). Vertical lines at bottom 
indicate stimulus timing. Two-photon images of GCaMP6s-expressing 
neuronal cell bodies in layer 4 of V1 (white circles) are shown to the left of 
the response curves. Scale bar, 25 um. (€) Cumulative frequency of 
esponding cortical neurons in mice transduced with both rTRPV1 and 
nanorods (black, Agps = 915 nm) and in control, uninjected mice (gray). 

(D) Cortical neuron response amplitudes (AF/F). Logio light intensity = 18.3. 
Number of neurons given as ratio of responding neurons to measured 
neurons. (E) Morphology of nanorods tuned to 915 nm (left) and 980 nm (right), 
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-1.0 1.0 


Wavelength preference index (WPI) 


measured by transmission electron microscopy. (F) Schematic showing 
nanorod absorption spectra relative to visual pigment of the human 

retina. (G) Example cortical calcium responses (mean AF/F, five repetitions) 
to 915-nm and 980-nm light stimulation (logi light intensity = 18.3). 

Left: Mice transduced with rTRPV1 and nanorods with Aap; = 915 nm 

(n = 3 mice). Right: Mice transduced with rTRPV1 and nanorods with 

habs = 980 nm (n = 4 mice). Arrows indicate stimulus timing. Two-photon 
images of GCaMP6s-expressing neuronal cell bodies in layer 4 of V1 (white 
circles) are shown to the left of the response curves. Scale bar, 25 um. 

(H) Frequency of cortical neurons as a function of the WPI in mice 
transduced with rTRPV1 and nanorods with Aaps = 915 nm (orange) or 
nanorods with Aap; = 980 nm (gray). For nanorods with Ap; = 915 nm, larger 
fraction of 915 nm (WPI < QO) over 980 nm (WPI > 0) light preferring neurons. 
For nanorods with Aap; = 980 nm, larger fraction of 9830 nm (WPI > 0) 

over 915 nm (WPI < 0) light preferring neurons. Light intensities as in (G). 
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Fig. 3. NIR light-guided mouse behavior. (A) Schematic showing location of 
TRP channel-expressing, infrared-sensitive pit organ. Information is overlaid in the 
optic tectum. (B) Expression of sTRPA1 in cones of rdl mice. Top: Top view of a 
retina transduced with both sTRPA1 and nanorods, immunostained for OLLAS 
(orange) and overlaid with Hoechst nuclear stain (gray). Bottom: Cross section of the 
retina shown in top image. Scale bars, 25 um. (©) Number of OLLAS-positive 
(orange) cones per square millimeter in rdl retinas transduced with both sTRPA1 
and nanorods (n = 5 mice) or in control, uninjected rdl retinas (n = 5 mice). 
Dashed arrow indicates maximum cone density in rd1 mice at postnatal day 70 (10). 
Each data point is collected from a different region of a retina (three regions per 


retina). (D) Schematic of behavioral task. NIR full-field stimulation of one eye 

(915 nm, 200 ms) cues water presentation for head-fixed, water-restricted P56-P73 
rdl animals. Mice respond by licking before (anticipation) or after the appearance 
of water. (E) Lick response heatmaps. Rows: Responses of different mice. Columns: 
Responses in 100-ms time bins. Top: rdl mice transduced with both sTRPA1 and 
nanorods (Aabs = 915 nm, n = 9 mice). Middle: rd1 mice transduced with both rTRPV1 
and nanorods (Aaps = 915 nm, n = 9 mice). Bottom: Control, uninjected rdl mice 
(n = 10 mice). Left: Stimulus logy light intensity = 17.9. Right: Stimulus logi light 
intensity = 18.3. (F) Mean anticipatory lick rates quantified from (E) as a function of 
light intensity. Error bars, SEM. 


in vivo in P51-P83 animals, recording layer 4 
activity at single-cell resolution during NIR 
light stimulation of the eye (Fig. 2A). In NIR 
sensor-injected animals, cortical neurons 
showed NIR light-evoked increases in calcium 
signal (Fig. 2B), which were light-intensity de- 
pendent (Fig. 2C and fig. S5). Neuronal activa- 
tion was greater in NIR sensor-injected animals 
than in controls (table S1, row H, and Fig. 2C). 
Nanorod absorption spectra can be wavelength- 
tuned by varying nanorod aspect ratios (length- 
to-width ratios) (Fig. 2E and fig. S6). To test 
whether the action spectra of neuronal ac- 
tivity can also be tuned, we selected a second 
type of gold nanorod with peak absorption at 
980 nm (aspect ratio: 5.5) and compared with 
nanorods with peak absorption at 915 nm 
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(aspect ratio: 4.0). Both types were paired to 
rTRPVI. For each nanorod type, we performed 
layer 4 cortical calcium imaging in P51-P71 rd1 
mice twice: once with 980-nm stimulation of 
the eye, and once with 915-nm stimulation. To 
classify cortical neurons as 980 nm or 915 nm 
responsive, we computed a wavelength prefer- 
ence index (WPI) for each NIR light-responsive 
neuron. We found a preference for 980-nm 
light over 915-nm light using nanorods tuned 
to 980 nm (table SI, row I; Fig. 2H; and fig. S7). 
Similarly, in animals with nanorods tuned 
to 915 nm, more cortical neurons preferred 
915-nm light over 980-nm light (table S1, row 
J; Fig. 2H; and fig. S7). 

Next, we asked whether molecular com- 
ponents can be tuned to increase sensitivity. 
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A variety of TRPA1 channels also serve as 
heat sensors. TRPA1 from the Texas rat snake 
(Elaphe obsoleta lindheimeri) is activated at a 
lower temperature than rTRPVI1 (3). To deter- 
mine the suitability of snake TRPAI as a NIR 
sensor component, we first engineered TRPA1 
to express the peptide epitope tag OLLAS 
(72) within the first or second extracellular 
loop. Anti-OLLAS antibodies show improved 
immunodetection compared with anti-6x- 
His and other antibodies for conventional 
epitope tags (12). To identify loop domains, 
we first determined the location of extracel- 
lular loop domains of human TRPAI from 
its cryo-electron microscopy structure (13). 
Subsequently, we identified potential loop do- 
mains of snake TRPAI after pairwise sequence 
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Fig. 4. NIR light responses in 
the ex vivo human retina. 

(A) Schematic of a human retinal 
explant 8 weeks postmortem. 
Long-term culture leads to 

oss of outer segments and light 
esponses. (B) Top: Top view 

of a human retina transduced 
with both rTRPV1 and nanorods, 
immunostained for TRPV1 (blue) 
and overlaid with Hoechst 
nuclear stain (gray). Bottom: 
Cross section of the retina shown 
in top image. Scale bars, 10 um. 
(C) Number of rTRPV1-positive 
(blue) photoreceptors per square 
millimeter in human retinas 
transduced with both rTRPV1 
and nanorods or in control, 
untransduced human retinas. 
Each data point is collected 

from a different region of 

a retina (five regions per retina). 
(D) Distribution of rTRPV1-positive 
cells across retinal layers. 

(E) Scanning electron microscopy 
image of gold nanorods bound 

to a rTRPV1-transduced human 
retina. Scale bar, 1 um. Inset: 
Transmission electron microscopy 
image showing an anti-6x-His 
antibody touching a gold nanorod 
with its Fc domain. Scale bar, 

20 nm. (F) Example calcium 
responses recorded from human 
retinal neurons in ONL, INL, 

and GCL in response to full-field 
NIR light. Vertical lines at bottom 
indicate stimulus (915 nm, 

100 ms) timing. Two-photon 
images of OGB-1 filled cell bodies 
(white circles), left of the response 
curves. Scale bar, 10 wm. 

(G) Cumulative frequency of 
responding neurons in different 
layers of human retinas trans- 
duced with both rTRPV1 and 
nanorods (Aabs = 915 nm) or in 
control, untransduced human 
retinas. (H) Response amplitudes 
(AF/F) by retinal layer. Increase 
(t) and decrease (|) of calcium 
signal are shown separately. 
Number of neurons given as 

ratio of responding neurons to 
measured neurons. Logi light 
intensity = 18.9. 
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alignment (/4) between human and snake se- 
quences. OLLAS was placed after amino acid 
755 or 758, corresponding to the first loop, or 
after amino acid 824, corresponding to the 
second loop (fig. S8). 

We performed whole-cell voltage clamp 
in HEK cells expressing TRPA1.755-OLLAS, 
TRPA1.758-OLLAS, TRPA1.824-OLLAS, or un- 
tagged TRPAI while activating the channels 
by TRPAI agonist allyl isothiocyanate. The 
sizes of evoked currents were similar between 
TRPA1.755-OLLAS and TRPAI (table S1, row K) 
and between TRPA1.824-OLLAS and TRPA1 
(table S1, row L, and fig. $9). Currents were 
undetectable for TRPA1.758-OLLAS (fig. S9B). 
We used TRPA1.755-OLLAS (abbreviated as 
sTRPA1I) in subsequent experiments. 

We targeted sTRPAI to cone photoreceptors 
of rdl mice using the same AAV-based ap- 
proach that was used for rTRPVI1 (Fig. 3). To 
activate the channel, nanorods (Aap, = 915 nm) 
were conjugated to anti-OLLAS antibodies. 
Cell membrane expression of sTRPA1 was 
seen in 50 + 13% of rdl1 cone photoreceptors 
(table S1, row M, and Fig. 3C). Cones made 
up 99 + 0.8% of OLLAS-positive cells (table S1, 
row N). 

To compare rTRPVI1 and sTRPAI sensitiv- 
ities, we performed behavioral tests in NIR 
sensor-injected P56-P73 rd1 mice. NIR light of 
two different intensities cued delayed water 
appearance for water-restricted, head-fixed 
animals (Fig. 3D). We evaluated anticipatory 
lick rates, defined as lick signal after a NIR 
light flash but before the appearance of water. 
To measure whether NIR light affects the be- 
havior of wild-type animals, we trained wild- 
type mice with NIR (915 or 980 nm) and/or 
visible light (fig. SIOA). NIR sensor-injected 
mice learned to associate NIR light with water 
within 4 days. At the lower NIR intensity, 
anticipatory lick rates were similar between 
control mice and mice with rTRPVI1 (table 
S1, row O) but higher for mice injected with 
sTRPAI (table S1, row P, and Fig. 3, E and F). 
At the higher NIR intensity, rTRPVI1 led to 
higher lick rates compared with control mice 
(table SI, row Q), but lower than with sTRPA1 
(table S1, row R, and Fig. 3, E and F). Be- 
havioral performance of rTRPV1 and sTRPA1 
mice was similar to that of wild-type mice 
trained for 4 days using visible light (table S1, 
rows S and T, and fig. S10). NIR light neither 
elicited behavioral responses in wild-type mice 
nor affected wild-type behavioral responses to 
visible light (fig. S10C). 

To test safety aspects of inducing NIR light 
sensitivity, we first evaluated the effect of pro- 
longed NIR light exposure on wild-type retinas 
by immunostaining. NIR light neither acti- 
vated microglia nor reduced retinal layer thick- 
ness, opsin density, or cone density (fig. S11). 
Second, we tested nanorod biocompatibil- 
ity with the rdl retina 80 and 100 days after 
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subretinal injection by immunostaining. Nano- 
rods neither activated microglia, increased 
apoptosis, nor reduced retinal layer thickness 
(fig. S12). 

Finally, we sought to induce NIR light sen- 
sitivity in blind human retinas (Fig. 4). We tar- 
geted rTRPV1 to adult human ex vivo retinal 
explants, in culture for 8 weeks postmortem 
(Fig. 4B and fig. S13). Retinas lose normal 
light-evoked activity within 24 hours of iso- 
lation (15). Using AAV delivery and a CAG 
promoter, we transduced 2477 + 889 photo- 
receptors per square millimeter of human ret- 
ina (mean + SD, n = 3 explants) with rTRPV1 
(Fig. 4C). Of the rTRPV1-positive cells, 94.5 + 
4.2% were photoreceptors (table S1, row U, 
and Fig. 4D). To measure whether NIR light 
drives responses in the human retina, we de- 
posited nanorods (Aap; = 915 nm) over the 
photoreceptor side. To record calcium signals, 
we used the fluorescent calcium dye OGB-1 
(16, 17). We then performed two-photon cal- 
cium imaging of individual neurons in the 
outer nuclear layer (ONL), inner nuclear layer 
(INL), and GCL (Fig. 4A). We observed NIR 
light-induced activation of different human 
retinal cell classes (Fig. 4, F to H). Most photo- 
receptors (73%) showed NIR light-evoked 
increases of calcium signal (Fig. 4H). Some 
photoreceptors (27%) showed decreases of 
calcium signal, likely reflecting horizontal 
cell feedback to NIR light-insensitive photo- 
receptors (Fig. 4H). In neurons of the INL 
and GCL, we observed both increases and 
decreases in calcium signal, indicating acti- 
vation of excitatory and inhibitory retinal 
pathways (Fig. 4H). More cells responded in 
the GCL than in the ONL, reflecting conver- 
gent retinal circuit organization (Fig. 4G). 
Sizes of light-evoked calcium responses were 
comparable to those in published reports 
(16, 17). 

Here, we described an approach to enable 
NIR light sensitivity in blind retinas, designed 
to be compatible with remaining vision (see 
supplementary text). We used gold nanorods 
coupled to temperature-sensitive engineered 
TRP channels to induce NIR light sensitivity 
in remaining photoreceptor cell bodies of blind 
mice and in ex vivo human retinas. In mice, 
NIR light-sensitized photoreceptors activated 
cortical visual circuits and enabled behavioral 
responses. By means of distinct nanorods, epi- 
tope tags, and TRP channel types, we tuned 
NIR responses to different wavelengths and to 
different radiant powers. In the human ret- 
ina, we reactivated light responses in photo- 
receptors and their retinal circuits 8 weeks 
postmortem. Our recordings of NIR light-evoked 
activity in the postmortem human retina pro- 
vide not only proof-of-principle for translation 
but also a model with which the function of 
human retinal cell types and circuits can be 
studied. 
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TOPOLOGICAL OPTICS 


A fractional corner anomaly reveals 


higher-order topology 


Christopher W. Peterson’, Tianhe Li2, Wladimir A. Benalcazar’, Taylor L. Hughes”, Gaurav Bahi** 


Spectral measurements of boundary-localized topological modes are commonly used to identify 
topological insulators. For high-order insulators, these modes appear at boundaries of higher 
codimension, such as the corners of a two-dimensional material. Unfortunately, this spectroscopic 
approach is only viable if the energies of the topological modes lie within the bulk bandgap, which 

is not required for many topological crystalline insulators. The key topological feature in these insulators 
is instead fractional charge density arising from filled bulk bands, but measurements of such charge 
distributions have not been accessible to date. We experimentally measure boundary-localized fractional 
charge density in rotationally symmetric two-dimensional metamaterials and find one-fourth and 
one-third fractionalization. We then introduce a topological indicator that allows for the unambiguous 
identification of higher-order topology, even without in-gap states, and we demonstrate the associated 


higher-order bulk-boundary correspondence. 


opological insulators (TIs) are materials 
with a gapped band structure charac- 
terized by quantized quantities, called 
topological invariants, that are invariant 
under deformations that preserve both 
the bulk bandgap and any protective symme- 
tries (/, 2). At a boundary between two mate- 
rials that have different strong topological 
invariants—i.e., where a topological invariant 
changes in space—the bandgap closes, and ro- 
bust boundary-localized gapless modes appear. 
Detection of these robust gapless boundary 
modes is therefore one of the most striking 
signatures of topological materials. 

We focus on two-dimensional (2D) TIs in 
class AI (spinless and time-reversal symmet- 
ric) (3). In this class and dimension, no non- 
trivial strong topological invariants exist (i.e., 
those protected by particle-hole, chiral, and/or 
time-reversal symmetry, such as the Zp» in- 
variant for a quantum spin Hall insulator in 
class AII), but invariants can be defined if 
additional spatial symmetries are present. 
Materials with invariants protected by spatial 
symmetries are known as topological crystal- 
line insulators (TCIs) (4, 5). We are specifical- 
ly interested in a recently discovered class of 
TCIs whose members have gapped boundaries 
of codimension one but host gapless modes 
at boundaries with codimension greater than 
one, i.e., at a boundary of a boundary (6-9). Be- 
cause these insulators manifest robust gapless 
modes at boundaries with higher codimension, 
they have been termed higher-order topological 
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insulators (HOTIs). We note that spatial sym- 
metries are essential for these HOTIs because 
they prevent bulk and surface deformations 
that hybridize and gap out the set of higher- 
order gapless states. 

Only a few naturally occurring HOTIs have 
been identified (8, 9). Instead, much of the 
experimental study of HOTIs (primarily d-th 
order TCIs in d dimensions) has been per- 
formed in engineered metamaterials, such 
as networks of coupled resonators (10-16), 
waveguide arrays (17, 18), and photonic or 
sonic crystals (19-23). So far, the clearest in- 
dicator of higher-order topology in such sys- 
tems has been the spectroscopic measurement 
of robust localized corner modes with energies 
inside the bulk bandgap of 2D (10-14, 17-23) 
and 3D (15, 16) HOTIs. 

However, there is a fundamental problem 
with using localized in-gap boundary modes to 
identify higher-order topology (or, generally, 
topology protected by spatial symmetries). Spa- 
tial symmetries essentially divide a material 
into symmetric sectors and require that local- 
ized modes in each sector are identical. Hence, 
these symmetries protect the degeneracy of 
boundary-localized modes but do not restrict 
their energy (24). Additional local symme- 
tries (e.g., chiral symmetry or particle-hole 
symmetry) can pin the boundary modes to zero 
energy (midgap) (24, 25), but these symmetries 
are not actually necessary to protect the higher- 
order topology, and many lattice models do 
not support their implementation at all. This 
implies that the energy of localized bound- 
ary modes may reside either in the bulk gap 
or fully within the bulk bands of a HOTI, 
depending on the material’s details. TIs that 
fall into the latter case do not host gapless 
boundary modes within their bulk bandgap 
and, as such, cannot be distinguished from 
trivial insulators by their spectrum alone, even 
with fully open boundary conditions. 
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This fundamental principle means that 
HOTIs could be misidentified when their 
spectra do not exhibit in-gap modes, and it 
motivates the search for an experimentally 
measurable indicator of higher-order topology 
that is protected by only spatial symmetries. It 
has previously been established that spatial 
symmetries protect boundary-localized, quan- 
tized fractional charge in TCIs (6, 24, 26-29). 
In this work, we demonstrate that a similar 
feature in metamaterials—namely, the mode 
density of the spectral bands—can also be 
fractionally quantized and can diagnose both 
first-order and higher-order topology in gapped 
TCIs. In two dimensions, we term the quantity 
indicating second-order topology as a fraction- 
al corner anomaly (FCA) in the bulk mode 
density. 

We define mode density as the local den- 
sity of states (DOS) integrated over an entire 
band, which is equivalent to the charge den- 
sity of a filled band in an electronic insula- 
tor. Unlike charge density, using mode density 
enables us to study the topology of bands 
without regard for electronic filling or con- 
straints imposed by charge neutrality. In 2D 
TCls, first-order nontrivial topology mani- 
fests as an edge-localized fractional mode 
density o. For TCIs with only first-order to- 
pology, the corner-localized fractional mode 
density p is the sum of the fractional mode 
densities, 6, and oo, that respectively mani- 
fest at the edges that intersect to form that 
corner, such that p = o, + 62 mod 1 (modulo 
operation 1) (30). A fractional quantized de- 
viation from this value definitively indi- 
cates higher-order topology, such that 2D 
TCIs without this deviation are not higher- 
order. To measure this type of higher-order 
feature in the bulk mode density, we define 
the FCA 6, where 


6 = p— (0, + 62) mod1 (1) 


to capture second-order topology in 2D TCIs. 
A detailed motivation for the FCA and a gen- 
eralized proof of this definition are provided 
in the supplementary materials (30). 
Notably, although a nonzero FCA does not 
indicate that corner modes lie within the 
bulk bandgap, it does indicate the existence of 
robust topological corner modes somewhere 
in the spectrum—i.e., either within the bulk 
bands (or edge bands) or the bandgap. When 
topological corner modes are not spectrally 
isolated, the corner modes can generally cou- 
ple to, and hybridize with, bulk or edge modes, 
although it was recently shown that in some 
cases corner modes within a bulk band can 
act as bound states in the continuum (37). 
However, when spectrally isolated from both 
the bulk and edge modes, they form the fa- 
miliar exponentially localized OD in-gap 
corner modes (10, 12, 13, 15, 16, 20). In the 
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Fig. 1. Fabricated metamaterials and measured spectra. (A) Photograph of the experimental resonator array with C4 symmetry. The schematic on the 
right illustrates the coupling between resonators. (B) Photograph of the experimental resonator array with C3 symmetry. The schematic on the right illustrates 
the coupling between resonators. (C) Measured DOS spectrum for the resonator array in (A). arb. units, arbitrary units. (D) Measured DOS spectrum 


for the resonator array in (B). 


next section, we experimentally measure a 
nonzero FCA in insulators where the corner 
modes are hybridized with bulk modes. Sim- 
ulation results, detailed in the supplemen- 
tary materials (30), show that the energy 
of topological corner modes can be tuned 
into, and even fully across, the bulk band- 
gaps (and any edge bandgaps) when a local- 
ized potential is applied to only the corner 
unit cells. We then demonstrate that these 
corner modes can be spectrally isolated and 
exponentially localized by deformation of 
the corner unit cell. 

To observe the FCA experimentally, we con- 
structed two rotationally symmetric TI meta- 
materials in microwave-frequency coupled 
resonator arrays. We chose to test two insu- 
lators with different symmetries because the 
quantization of the fractional mode density 
and FCA depends on the rotation symmetry 
group (24). The first insulator, shown in Fig. 1A, 
is on a square lattice with C, symmetry, and 
the second insulator, shown in Fig. 1B, is on 
a kagome lattice with C,; symmetry. We first 
found the spectral DOS of both metamaterials 
by means of reflection measurements; see the 
supplementary materials (30) for details of 
the measurement technique. The measured 
spectrum of the C,-symmetric insulator, shown 
in Fig. 1C, has three distinct bands. The mea- 
sured spectrum of the C3-symmetric insulator, 
shown in Fig. 1D, has two bands. Neither of 
these insulators have in-gap modes, so from the 
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spectra alone it is not possible to tell whether 
either metamaterial is topologically nontrivial. 
However, as we will show, both are in fact 
nontrivial, and the intrinsic chiral symmetry 
breaking causes the edge and corner modes 
to lie within the bulk bands (30). 

Next, we calculate the mode density of the 
measured bands by integrating the local DOS 
in each unit cell over their respective frequency 
ranges, as shown for both insulators in Fig. 2. 
The mode density of the C,-symmetric insu- 
lator is shown in Fig. 2A and has several im- 
portant features on which we will focus. First, 
we find that the resonators in the bulk unit 
cells are excited in all three bands, which in- 
dicates that this insulator nominally has three 
bulk bands. We observe that the total mode 
density of these bands in each sector is ap- 
proximately equal, which demonstrates that 
this insulator has approximate C, rotation 
symmetry with a small amount of symmetry- 
breaking disorder from fabrication imperfec- 
tions. As expected, the mode density in the 
bulk unit cells, designated by 1” (where the 
superscript indicates C,, mee ande is always 
an integer. Specifically, my 3 ~ 1(where the sub- 
script indicates the band number) and uw, (4) 2, 
which indicates that band 2 is a twofold 
degenerate band whereas bands 1 and 3 are 
nondegenerate. 

Most importantly, we find a nonzero frac- 
tional mode density in the edge and corner 
unit cells. Because the bandgaps are relatively 
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large compared with the width of the bands 
(i.e., the system has a very short correlation 
length), the entirety of the fractional mode 
density is tightly localized within the bound- 
ary unit cells. For bands 1 and 3, the frac- 
tional mods density in the edge unit cells 
i o\* ae , and in the corner unit cells it is 
PL. (4) as 2 a Jin the twofold degenerate band 2, 
these fractions are doubled. This doubling 
occurs because band 2 is equivalent to two 
copies of band 1, in the sense that the Wannier 
representation of the orbitals in band 2 has 
two Wannier centers at Wyckoff position }, 
whereas band 1 has only one at position b 
(30); each Wannier center contributes equal 
fractional mode density to the boundary. The 
approximate fractions written above are 
obtained by rounding to the nearest quarter, 
as we anticipate that C, symmetry quantizes 
mode density in fractions of one-fourth (24). 
We can now extract the FCA for each bulk 
band using the mode density data in Fig. 2A. 
Because the system is near a zero-correlation- 
length limit, we find » using the simple for- 
mula © = p — 20, where p is the fractional 
mode density of the corner unit cell and o is 
the fractional mode density of the edge unit 
cells (because of C, symmetry, all edges are 
expected to be identical). Here, because there 
is a small amount of unavoidable disorder 
in the experiment (which slightly breaks C, 
symmetry), we average over all the edges to 
find o and over all the corners to find p, 
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Fig. 2. Spatial distribution of bands. (A) Measured mode density for the C4-symmetric insulator. The mode density for each band is shown separately. Each filled circle 
represents a resonator, with the area of the circle corresponding to the measured mode density of that resonator in that band. The total mode density of each unit cell 
is shown with black text, and the total mode density of each sector is shown in purple with white text. (B) Same as (A) but for the C3-symmetric insulator. 


such that oY -iLe., the FCA for band 1 in the 
C,-symmetric metamaterial—is 


go =p — 26 = 0.23 — 0.98 
= 0.5% (2) 
= 0.25% 7 


A similar calculation can be carried out for 
the other bulk bands, giving 0 = 0.55 = 3 
for band 2 and 6") = 0.19 = } for band 3. The 
sum of the FCA over all the bulk bands is al- 
: : : (4) 
ways an integer (here rounding gives 6, ’+ 


Peterson et al., Science 368, 1114-1118 (2020) 


os + oS” = 0.99), and 0S” is twice o{*?, as 
expected. 

Next, we move on to the mode density for 
the C3-symmetric system, shown in Fig. 2B. 
For this material, we again find that the bulk 
unit cells have integer mode density, y?) x2 
(a twofold degenerate band) and yi?) =1L Asin 
the previous system, the total mode density of 
these bands in each sector is approximately 
equal. The edge unit cells have a fractional 
mode density of 6(”) = 3 in band 1 and of) x i 
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in band 2. Here, the approximate fractions 
are obtained by rounding to the nearest third 
because this system is Cz; symmetric (24). 
Notably, although this material does not have 
a fractional mode density in the corner unit 
cells, the FCA indicator is nonzero, o”) = 
0.70 = 3, and o@) = 0.30 = 3. For comparison, 
in the supplementary materials (30) we also 
present experimental measurements of the 
FCA for a trivial insulator and show that 
(3): V = © for all bands. 
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Fig. 3. Pulling topological modes into the gap. (A) The schematic on the left shows where a small, negative on-site potential is applied (white circles). The 
measured DOS has modes within the bulk bandgap, which are spatially localized to the corners and confined to one sublattice. (B) Same as (A) but for the 


C3-symmetric insulator. 


The nonzero FCA in both metamaterials 
indicates that they are indeed HOTIs, and 
we have argued above that they should host 
second-order topological modes at their corners. 
Because we have not observed these expected 
topological modes within the bulk bandgap, 
we can estimate their spectral location by find- 
ing the band in which the corner resonators 
are most strongly excited. In the Cy-symmetric 
system, the corner resonators, around which 
the second-order topological modes are ex- 
pected to exist, are mainly excited in band 3, 
which indicates that the corner modes lie in 
this band. Moreover, this implies that we can 
spectrally localize these modes by slightly 
lowering the resonance frequency of the cor- 
ner resonators. As illustrated in Fig. 3A, we 
applied a small negative potential to the cor- 
ner resonators using a capacitor connected to 
ground, which decreases the electrical length 
(and thus the resonance frequency) of the cor- 
ner resonators. When the potential is applied, 
we observe topological modes moving into the 
bandgap between bands 2 and 3 and becom- 
ing exponentially localized to the corner and 
confined to one sublattice. 

In the C3-symmetric system, the corner reso- 
nators are only excited in band 2, which again 
indicates that the energy of the corner modes 
is too high and should be lowered to bring the 
modes into the bandgap. We pull these modes 
into the bandgap by similarly applying a small 
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negative potential to the corners, as illustrated 
in Fig. 3B. The topological modes are observed 
to spectrally localize within the bandgap and 
spatially localize to the corners with confine- 
ment on one sublattice. We also conducted a 
similar experiment on a trivial insulator (30) 
and found that localized corner modes cannot 
be isolated within the bulk bandgap, regard- 
less of the applied potential strength. 

The definition of the FCA can also be extended 
beyond two dimensions to identify d-th-order 
topology in fully gapped d-dimensional insu- 
lators. For example, the FCA for third-order 
TClIs in three dimensions is 


= 5 °(8—p; — oj) mod1 (3) 
a7 

where 6 is corner-localized fractional mode 
density, p; is hinge-localized fractional mode 
density, and o; is surface-localized fractional 
mode density. Because this indicator captures 
fundamental topological features that are pro- 
tected by spatial symmetries, we expect that it 
can assist with the experimental identifica- 
tion of materials with higher-order topology, 
which could otherwise be misidentified by 
only searching for in-gap corner modes. From 
a practical perspective, focusing on bulk- 
derived fractional mode density instead of 
localized modes could simplify experimental 
confirmations of novel TIs, which often use 
ad hoc supplementary boundary elements (e.g., 
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auxiliary resonators or loading capacitors) to 
spectrally shift topological modes into the 
bandgap (22-25, 28, 29, 31). 
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Thresholds of mangrove survival 


under rapid sea level rise 
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The response of mangroves to high rates of relative sea level rise (RSLR) is poorly understood. We 
explore the limits of mangrove vertical accretion to sustained periods of RSLR in the final stages 
of deglaciation. The timing of initiation and rate of mangrove vertical accretion were compared 
with independently modeled rates of RSLR for 78 locations. Mangrove forests expanded between 
9800 and 7500 years ago, vertically accreting thick sequences of organic sediments at a rate 
principally driven by the rate of RSLR, representing an important carbon sink. We found it very likely 
(>90% probability) that mangroves were unable to initiate sustained accretion when RSLR rates 
exceeded 6.1 millimeters per year. This threshold is likely to be surpassed on tropical coastlines 


within 30 years under high-emissions scenarios. 


he rate of relative sea level rise (RSLR) in 
tropical and subtropical locations is pro- 
jected to accelerate from current trends 
of ~3.4 mm year ‘ to a mean estimate of 
~5 mm year ' under low-emissions sce- 
narios and ~10 mm year under high-emissions 
scenarios by 2100 (J, 2). Modeling of feed- 
backs between RSLR, vertical accretion, root 
mass formation, and tidal marsh and man- 
grove vulnerability under sustained high rates 
of RSLR is vital for the survival of these eco- 
logically and economically important coastal 
environments (2). Some tidal marshes have 
been projected to survive RSLR of more than 
10 mm year ‘ where supported by high avail- 
able suspended sediment concentrations (3), 
on the basis of accretion data that span an- 
nual to decadal time scales. Reconstructions 
using paleoenvironmental proxies, however, 
have suggested that UK marshes were vul- 
nerable to retreat at RSLR of 7 mm year * (4). 
Mangroves grow in sheltered intertidal en- 
vironments that are exposed to the effects of 
RSLR (5). They support among the highest 
rates of carbon burial of all ecosystems (6), 
and a growing body of evidence suggests that 
this efficiency is enhanced by RSLR (7). How- 
ever, empirical data on mangrove response to 
high rates of RSLR are lacking, given the lim- 
ited observation period (1 to 16 years) of real- 
time measurements (5). 
Here, we assess the mangrove response to 
RSLR from the paleorecord of mangrove ver- 
tical accretion preserved in the sedimentary 
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archives of continental shelves and coastal 
lowlands. Mangrove forests have established 
and drowned in association with variability in 
the rate of RSLR after the onset of deglacia- 
tion ~26,000 to 20,000 years ago (8). Before the 
Holocene, long-term RSLR rates of >12 mm 
year ' (8) exceeded the capacity of mangroves 
to maintain position in situ through vertical 
accretion, and mangroves were displaced land- 
ward (9), with few exceptions (10, 17). During 
the early to mid-Holocene, RSLR slowed in 
association with the final phase of deglaciation 
of the Laurentide ice sheet (2). The rate of 
RSLR varied across the globe owing to glacio- 
isostatic adjustment, the response of the solid 
Earth and gravity field to ice mass redistri- 
bution during a glacial cycle (13). Far-field sites, 
those distal from regions of former ice sheet 
extent and incorporating most of the tropics, 
exhibit a decline in the rate of RSLR (/4) from 
10,000 to 7000 years ago (phase 1 in Fig. 1A) toa 
mid-Holocene highstand, followed by stable or 
falling sea level to near the current position 
from ~6000 years ago to the present (phase 2 
in Fig. 1A). By contrast, at intermediate-field 
sites closer to centers of glaciation (for example, 
the Gulf of Mexico and the Caribbean), sea level 
rose continuously throughout the Holocene 
owing to isostatic response to melting of the 
Laurentide ice sheet. These coastlines have 
experienced a decelerating rate of RSLR from 
10,000 years ago to the present (Fig. 1B and 
figs. S5 and S6). 

The spatially variable deceleration of RSLR 
from 10,000 to 8000 years ago across tropical 
and subtropical latitudes coincided with the 
initiation of vertically continuous, organic-rich 
mangrove sediments several meters thick, as 
rising seas flooded shallow continental shelves 
(phase 1 in Fig. 1, A and B) (15). This deceleration 
provides an opportunity to explore whether (i) 
the rate of mangrove vertical accretion responds 
to changes in RSLR; (ii) ice sheet proximity 
(intermediate versus far field), geomorphic set- 
ting, or tidal range constrains the capacity of man- 
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groves to accrete in relation to RSLR; (iii) upper 
thresholds of mangrove vertical accretion can be 
detected; and (iv) mangrove development and 
vertical accretion correspond in timing to changes 
in the global atmospheric carbon budget. 

We present empirical data from 122 recon- 
structions of the timing and rate of mangrove 
vertical accretion associated with Holocene 
RSLR in cores collected from 78 tropical and 
subtropical locations [Fig. 2; (14)]. We inde- 
pendently estimate rates of RSLR for each of 
the 78 locations before and for the duration 
of mangrove vertical accretion from a glacio- 
isostatic adjustment model using an ensem- 
ble of different Earth parameters (/6) (Fig. 3 
and figs. S5 and S6). 

Slowing RSLR during the early to mid- 
Holocene coincided with the initiation of ex- 
tensive mangrove forests (Figs. 1A, phase 1, and 
3, A and B). Our analysis suggests that sustained 
mangrove vertical accretion began across far- 
field regions (Africa, Asia, Australasia, and South 
America) at ~10,000 to 8000 years ago and 
intermediate-field regions (Caribbean and Gulf 
of Mexico) at ~8000 to 6000 years ago (Fig. 3 and 
figs. S1 and $2). Data were discriminated on 
the basis of ice sheet proximity and geomor- 
phic setting and differentiated by tidal regime 
to explore differences in the timing of initia- 
tion and rates of vertical accretion (74). The 
intermediate- and far-field classifications, de- 
fined as a function of a location’s proximity to 
areas of major ice sheet retreat during the last 
deglaciation, act as a surrogate variable for the 
temporal pattern of RSLR rates through the 
Holocene, which in turn influences the avail- 
ability of accommodation space within which 
mangroves can accrete vertically. In a general- 
ized linear model of several variables, only the 
proximity to former ice sheets proved to have a 
significant relationship with accretion rates (/4). 

We found a strong relationship (p < 0.001, 
generalized linear model) between rates of 
mangrove vertical accretion and RSLR rate 
across all sites (51% of the variation in the 
accretion rate can be explained by the RSLR 
variation; Fig. 3). Mangrove vertical accretion 
first initiated ~9800 years ago in the Ganges- 
Brahmaputra River Delta, as RSLR decreased 
from ~9 to ~6 mm year’, and continued for 
~650 years at high rates until replaced by 
subtidal deposits as RSLR increased again to 
>7 mm year’ (14, 17). Elsewhere in far-field 
locations, mangrove vertical accretion initiated 
as RSLR decreased below ~7 mm year" starting 
8800 years ago (Fig. 3). Rates of vertical ac- 
cretion of >6 mm year! were sustained for 
more than 1000 years in mangrove forests in 
Australia, Africa, South America, Central America, 
and Asia (Fig. 2 and data S1), irrespective of 
geomorphic setting (fig. S1). 

Mangrove vertical accretion initiated in 
intermediate-field regions later than in far-field 
locations, first in Belize ~7800 years ago as 
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Fig. 1. Holocene RSLR and associated mangrove development in far-field estuarine and intermediate-field calcareous settings. (A) Far-field estuarine 
settings. (B) Intermediate-field calcareous settings. As the rate of RSLR decelerated, development of organic-rich mangrove sediments was initiated, which facilitated 
maintenance of vertical position with respect to sea level, forming extensive mangrove forests. When RSLR stabilizes, ongoing sedimentation infills estuaries, building 
deltas and replacing mangrove with freshwater wetlands, terrestrial forest, or saltpan, depending on climatic setting. Cl, credible interval. 


Fig. 2. Sampling locations and corresponding 
rates of mangrove vertical accretion. (A and 
B) Mangrove sedimentary units initiating between 
10,000 and 6500 years ago (A) and mangrove 
sedimentary units initiating after 6500 years ago 
(B). Contemporary mangrove shorelines are indi- 
cated in bright green, representing mangrove 
distribution in 2012 (30). 
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Fig. 3. Rates of mangrove vertical accretion and RSLR. (A and B) Accretion rates are derived from the depth of mangrove organic sediment between calibrated “C dates in 


individual cores, and rates of RSLR (median and 95% Cl) were derived from the glacio-isostatic adjustment (GIA) model ensemble for the same sites for far-field (A) and 
intermediate-field (B) locations. (C) The probability of initiation of sustained mangrove accretion at or above associated rates of RSLR across all sites. The dotted line shows the 
RSLR rate at which there is a 10% probability that mangroves can initiate accretion and/or, conversely, that there is a 90% probability that they are unable to initiate accretion. 


eee: 
Table 1. Probability that mangroves are unable to initiate sustained vertical accretion at rates of RSLR. See (14). RSLR rates at all sites (global) and 
intermediate- and far-field sites at which it is very likely (>90% probability) and extremely likely (>95% probability) that mangroves are unable to initiate 
sustained accretion and the associated 95% uncertainty interval (UI). For example, at all sites, there is a 94.3 to 95.4% probability (95% Ul) that mangroves 
are unable to initiate at rates that exceed 7.6 mm year? 


RSLR rates above which accretion did not initiate 
(very likely: 90% median probability) 


RSLR rate (mm year") 


Uncertainty interval (%) 


(88.1-92.8) 


RSLR rates above which accretion did not initiate 
(extremely likely: 95% median probability) 


RSLR rate (mm year) 


Uncertainty interval (%) 


(93.7-96.9) 


Far-field Tal 


RSLR fell below ~5 mm year” and in other 
Caribbean and Gulf of Mexico sites between 
7500 and 6000 years ago (Figs. 2 and 3 and 
table S1). This later initiation, in comparison 
to far-field regions, may be related to the lim- 
ited allochthonous sediment supply availa- 
ble in the carbonate (reef) settings, although 
in Belize, rates of accretion of more than 6 mm 
year’ were observed at two sites (data S1), 
driven by strong authochthonous inputs. 

We used a Bayesian framework to estimate 
the probability of initiation of mangrove ac- 
cretion conditional on rates of RSLR within 
the Holocene dataset (74). The empirical Holo- 
cene data (data S1), which span a wide range 
of geomorphic settings and tidal regimes (fig. 
$2), suggest that when RSLR rates exceed 6.1 and 
7.6 mm year ", respectively, mangroves are very 
likely (>90% probability) and extremely likely 
(>95% probability) to be unable to initiate sus- 
tained accretion. We found lower RSLR thresh- 
olds for intermediate-field sites and higher 
thresholds for far-field sites (Table 1). 

Our database also reveals spatial variability 
in the duration of mangrove accretion (figs. 
S1 and 82). In only 3 of 50 far-field locations 
was there indication of drowning of man- 
groves during a marine transgression (i.e., man- 
grove sediments overlain by tidal or subtidal 
deposits; data S1). In most far-field locations 
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(87.3-91.6) 


(30 of 50), accretion and progrodation of the 
fluvial delta led to the replacement of man- 
grove by saltmarsh (characteristic of upper 
intertidal elevations), freshwater wetland, or 
terrestrial forest, often by the mid-Holocene 
when relative sea level stabilized (phase 2 in 
Fig. 1A and data S1). For this reason, contem- 
porary mangrove extent is highly contracted 
compared with the early- and mid-Holocene 
mangrove development in many major river 
deltas such as the Ord River, Australia; the 
Red River, Vietnam; and the Mekong River, 
Vietnam and Cambodia (15, 18-20). Man- 
grove accretion persisted significantly longer 
at intermediate-field locations (fig. S2) be- 
cause accommodation space was enhanced 
by glacio-isostatic adjustment. 

The extensive development of mangrove 
environments under RSLR has exerted an in- 
fluence on global carbon cycles over geologic 
time scales (21), including, we suggest, during 
the early to mid-Holocene (Fig. 4). Carbon 
balance modeling based on stable carbon 
isotope signatures suggests that the 5 parts per 
million by volume reduction in atmospher- 
ic CO, in the early Holocene was driven by 
increases in the uptake of carbon by the land 
biosphere on the order of 290 Pg C (22, 23) 
before 7000 years ago. The timing and volume 
of this uptake have been attributed to the north- 
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8.8 (94.4-95.8) 


ward expansion of boreal vegetation after ice 
sheet retreat [~110 Pg C (24)] and organic soil 
development [180 Pg C (23)], of which circum- 
Arctic peatlands may have sequestered 20 to 
60 Pg C based on the depth of peat formation 
at the time (24). We conservatively estimate a 
somewhat larger contribution (~85 Pg C) from 
mangrove carbon sequestration and burial 
over the period 8600 to 6000 years ago (14). 
The extensive development of mangrove for- 
ests over this period largely replaced methan- 
ogenic environments {freshwater wetlands 
and floodplains [data S1; (25)]} and corre- 
sponds to declining rates of methane emis- 
sion, particularly in the Southern Hemisphere 
(26). The 8CH, signals in the Southern Hemi- 
sphere for this period show a 1.5 per mil deple- 
tion consistent with a replacement of vegetation 
using the C, photosynthetic pathway (tropical 
grasslands and saltmarsh adapted to low at- 
mospheric CO,), with mangroves using the 
Cz pathway (26, 27). 

As RSLR increases in the 21st century, an in- 
crease in the rate of mangrove vertical accretion, 
coupled with landward expansion as sea level 
rises, can be expected to drive increases in the 
rate of carbon sequestration and preservation 
in mangrove environments, providing a nega- 
tive feedback on radiative forcing, as suggested 
more broadly for coastal wetlands (7). Although 
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Fig. 4. Timing of mangrove 290 
vertical accretion in 
relation to greenhouse 
gas concentrations. (A and 
B) Atmospheric CO2 con- 
centrations [from Antarctic 
ice cores (23)] (A) and 
mangrove organic soil for- 
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our results demonstrate that accretion in man- 
groves increases in response to RSLR, we found 
that it was very likely (>90% probability) that 
mangroves were unable to initiate sustained 
accretion when RSLR rates exceeded 6.1 mm 
year ’ in any but the most sediment-laden set- 
tings. RSLR is projected to remain below 5 mm 
year’ under low-emissions scenarios [Repre- 
sentative Concentration Pathway (RCP) 2.6] 
throughout the 21st century (7). However, RSLR 
is expected to exceed 5 mm year! by 2030 and 
7mm year * by 2050 under high-emissions sce- 
nario RCP8.5 in low-latitude mangrove set- 
tings where rates of RSLR are expected to be 
higher than the global average (J, 2). 

Where a deficit commences between verti- 
cal accretion and RSLR, time to submergence 
will be a function of the position of the man- 
grove within the tidal frame. In settings of low 
tidal range, mangroves are more likely to be 
situated at elevations close to the threshold of 
submergence from the outset. In settings of 
high tidal range, mangroves are more likely 
to be situated at elevations well above this 
threshold and tolerate a deficit between the 
rates of accretion and RSLR for decades to 
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in tide- and river-dominated estuaries may 
provide an elevation subsidy not available in 
environments receiving low sediment supply, 
such as coral reefs. In this context, sediment 
retention in catchments affected by water 
resource development (i.e., trapped behind 
dams) and local sediment controls may de- 
crease mangrove resilience to RSLR in river 
estuaries (5). The natural response of man- 
grove encroachment across flooded coastal 
lowlands is therefore the main determinant 
of future extent (28), although this is already 
greatly impeded by coastal developments along 
many coastlines (29). Our findings therefore 
emphasize the importance of (i) mitigating the 
magnitude of rapid RSLR and (ii) ensuring that 
coastal adaptation measures allow for the ex- 
pansion of mangrove across coastal lowlands. 
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PIRs mediate innate myeloid cell memory to nonself 


MHC molecules 
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Immunological memory specific to previously encountered antigens is a cardinal feature of adaptive 
lymphoid cells. However, it is unknown whether innate myeloid cells retain memory of prior antigenic 
stimulation and respond to it more vigorously on subsequent encounters. In this work, we show that 
murine monocytes and macrophages acquire memory specific to major histocompatibility complex | 
(MHC-I) antigens, and we identify A-type paired immunoglobulin-like receptors (PIR-As) as the MHC-I 
receptors necessary for the memory response. We demonstrate that deleting PIR-A in the recipient or 
blocking PIR-A binding to donor MHC-! molecules blocks memory and attenuates kidney and heart 
allograft rejection. Thus, innate myeloid cells acquire alloantigen-specific memory that can be targeted 


to improve transplant outcomes. 


mmunological memory protects the host 

against infection but is also a potent barrier 

to transplant survival (J). Initially thought 

to be confined to T and B lymphocytes, it 

is now evident that innate myeloid and 
lymphoid [natural killer (NK)] cells also ac- 
quire features of immunological memory. In 
lymphoid cells, memory responses are anam- 
nestic (7-3). By contrast, myeloid cells exhibit 
memory-like responses, referred to as trained 
immunity, as secondary stimuli that elicit 
enhanced reactions need not be related to 
the primary stimulus (4). Whether myeloid 
cells acquire memory specific to previously 
encountered antigens is not known but has 
become an important question, given the key 
roles of these cells in host defense (5). 
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We studied monocyte and macrophage re- 
actions to allogeneic bone marrow plug grafts 
placed under the kidney capsule. Monocytes and 
macrophages sense major histocompatibility 
complex (MHC) and non-MHC determinants 
on allogeneic tissues and mature to antigen- 
presenting, inflammatory, or cytotoxic cells 
(6-9). To investigate memory, B6-Rag’ Il2rg 
mice, which lack B, T, NK, and innate lymphoid 
cells, were immunized with irradiated alloge- 
neic splenocytes. They were then rechallenged 
with grafts from same-party (i.e., the same 
source as the immunizing splenocytes) or third- 
party allogeneic donors after 7, 28, or 49 days. 
B6-Rag’ Il2rg’- (H-2b) mice immunized with 
BALB/c (H-2d) splenocytes and rechallenged 
with same-party BALB/c allografts 7 or 28 days 
later exhibited significantly greater graft in- 
filtration with recipient monocyte-derived 
dendritic cells (Mo-DCs) than unimmunized 
recipients or those immunized with syngeneic 
(B6) or third-party allogeneic (C3H or H-2k) 
splenocytes (Fig. 1A). This enhanced response 
dissipated by 49 days after immunization. B6- 
Rag’ “Tlorg’ ~ recipients of C3H grafts mounted 
a heightened response if previously immunized 
with C3H but not B6 or BALB/c splenocytes, 
which further demonstrates allospecificity 
(Fig. 1B). Enhanced responsiveness could not 
be attributed to immunogen persistence be- 
cause neither donor splenocytes nor intact 
donor MHC-I molecules were detected in the 
host’s circulation or spleen at the time of re- 
challenge (fig. SLA). Thus, host monocytes ap- 
pear to acquire specific memory to previously 
encountered alloantigens. 

To rule out any possible contribution of 
lymphoid cells in the graft, we repeated the 
experiments using Rag’ “Il2rg’~ donors. B6- 
Rag’ “Tlarg/ ~ recipients responded more 


vigorously to BALB/c-Rag” “[l2rg’~ bone mar- 
row plug allografts if previously immunized 
with BALB/c-Rag’ Jl2rg’”- but not NOD- 
Rag’ “Torg’ ~ splenocytes (Fig. 1C), whereas 
NOD-Rag’ “Targ/ ~ immunization enhanced 
the response to NOD-Rag’ “Torg’ ~ allografts 
(Fig. 1D). Thus, monocyte memory to allo- 
antigens is not dependent on lymphoid cells 
from either the graft or the recipient. 

The primary innate alloresponse that gen- 
erates Mo-DCs is mediated by inflammatory 
(Ly6C™) monocytes (6, 7). To test whether this 
monocyte subset acquires memory, we sorted 
Ly6chl monocytes from B6-Rag’’ “Tlarg’ a 
mice 7, 21, or 35 days after immunization and 
transferred them to naive B6-Rag’’ “Ti larg’ ~ 
hosts (Fig. 1E). Mice that received monocytes 
from allostimulated donors mounted a sig- 
nificantly greater response to BALB/c allo- 
grafts than those that received monocytes 
from donors stimulated with syngeneic cells. 
This demonstrates that Ly6ch! monocytes 
are capable of mediating memory and that 
they retain memory function for up to sev- 
eral weeks after their first encounter with 
an alloantigen. 

Macrophages acquire the ability to kill allo- 
geneic targets if first challenged with alloge- 
neic cells along with CD40 receptor cross-linking 
(9). We therefore examined whether macro- 
phages also acquire allospecific memory. B6- 
Rag Tare mice were left unstimulated (naive) 
or were immunized with BALB/c splenocytes 
plus agonistic anti-CD40 antibody, and an in vivo 
killing assay was performed 14 and 28 days 
later (Fig. 1F). BALB/c-immunized mice killed 
BALB/c but not syngeneic (B6) or third-party 
(C3H) targets, which suggests that macrophages, 
like monocytes, acquire memory specific to pre- 
viously encountered alloantigens. 

MHC proteins are the principal alloanti- 
gens in humans and mice. To test whether 
monocyte or macrophage memory is depen- 
dent on the detection of nonself MHC mole- 
cules, we immunized B6-Rag’ “Tl2rg’” (H-2b) 
recipients with irradiated BALB/c (H-2d) spleno- 
cytes and rechallenged them with BALB.B (H-2b) 
grafts. BALB.B and B6 mice share the same MHC 
(H-2) but differ at non-MHC loci. Immuniza- 
tion with BALB/c cells did not enhance the 
response to BALB.B grafts, which indicates that 
monocyte memory is against nonself MHC mol- 
ecules (Fig. 1G). The same was true for macro- 
phages (Fig. 1H). Moreover, BALB/c-immunized 
hosts killed B2-microglobulin knockout targets 
less efficiently than wild-type cells, which in- 
dicates that the response is primarily to non- 
self MHC-I molecules (Fig. 1H). Furthermore, 
macrophages from BALB/c-immunized B6 
mice bound allogeneic H-2Dd (and, to some 
extent, allogeneic H-2Kd) but not syngeneic 
H-2Db and H-2Kb MHC-I tetramers (Fig. 11). 
Additionally, host exposure to both MHC 
and non-MHC polymorphisms at the time of 
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priming was necessary to generate memory 
(fig. SIB). Polymorphism in the non-MHC 
signal regulatory protein alpha (Sirpa) gene, 
which is known to trigger the primary mono- 
cyte alloresponse (8), was required (fig. S1C). 
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Ig-like receptors (PIRs), which are orthologs of 
human leukocyte Ig-like receptors (LILRs) (10). 
Closely linked genes coding for six PIR-A pro- 
teins and one PIR-B protein are located on mouse 
chromosome 7 (11). PIR-B is nonpolymorphic 
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Fig. 2. Memory to allogeneic MHC-I is mediated by PIR-A molecules on 
monocytes and macrophages. (A and B) PIR-A and -B expression on Ly6C™ 
monocytes (A) and macrophages (B) before [day O (d.0)] and after alloimmuniza- 
tion (d.14 and d.28). Colored histograms show biological replicates (n = 3). 

(C to E) Blocking PIR-A/B inhibits monocyte (n = 12; N = 2) [(C) and (D)] and 
macrophage (n = 6; N = 2) (E) memory. B6-Rag’” 1l2rg’”" hosts were immunized 
with BALB/c splenocytes and rechallenged with BALB/c allografts 7 days later. 
(F to H) H-2Dd MHC-I tetramers bind preferentially to PIR-A3-transfected BWZ.36 


(n = 5; N = 1) (F) and 373 cells (G). PIR-A3/Fc binds preferentially to H-2Dd* (BALB/c) 
cells (H). B-gal; B-galactosidase. (I and J) Specific inhibition of macrophage 

(n = 5 to 6; N = 2) (I) and monocyte (n = 6; N = 1 to 2) (J) memory to BALB/c 
allografts by PIR-A3/Fc. Immun., Immunization. (K) Absent memory 7 and 

28 days after immunization, but normal primary monocyte alloresponse in Pira”~ 
hosts (n = 6; N = 1 to 2). Statistical analyses: representative of three experiments 


[(B), (G), and (H)]; mean ai 
(K)]; one-way ANOVA [(C) 


and binds a wide spectrum of MHC-I mole- 
cules (12). Its cytosolic domain transmits in- 
hibitory signals (12, 13), whereas PIR-As are 
stimulatory (74, 15). PIR-A isoforms are be- 
lieved to bind distinct MHC-I molecules (12), 
and PIR-A expression is up-regulated upon 
myeloid cell differentiation and activation 
(15). Using anti-PIR-A/B and anti-PIR-B anti- 
bodies, we detected PIR-A expression on rest- 
ing Lyec™ monocytes (Fig. 2A). Macrophages 
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expressed both PIR-A and PIR-B and up- 
regulated PIR-A after allostimulation (Fig. 
2B). Blocking these receptors with an anti- 
PIR-A/B antibody inhibited monocyte memory 
in Pirb-sufficient and Pirb’- B6-Rag’ “Tlorg’ = 
hosts (Fig. 2, C and D) and suppressed the 
killing of allogeneic targets in alloimmunized 
B6-Rag’ “Il2rg’” mice (Fig. 2E). We con- 
firmed these results by treating animals with 
PIR-A3/Fc fusion protein, which preferentially 


nd individual biological replicates [(C), (D), (F), and (I) to 
to (E), (I), and(J)]; or two-tailed Student's t test (K). 


blocks PIR-A3 from binding to its MHC-I lig- 
and, H-2Dd. The preferred ligand of PIR-A3 
is H-2Dd (Fig. 2, F to H). H2-Dd tetramers 
triggered the strongest signal in reporter cells 
expressing PIR-A3 (Fig. 2F). They bound PIR- 
A3-transfected 3T3 fibroblasts better than 
H-2Db tetramers (Fig. 2G), and the binding 
of PIR-A3/Fc was greater to BALB/c (H-2Dd*) 
than B6 cells (H-2Db* but H-2Dd_) (Fig. 2H). 
PIR-A3/Fc inhibited both macrophage (Fig. 21) 
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Fig. 3. Mechanisms of monocyte memory. (A) Tetramer-positive splenic monocytes 
7 days after immunizing B6-Rag’ ll2rg”~ mice (mean and individual biological 


replicates; n = 3; N = 1 to 3). wt, wild-type. (B to E) Sp 
analysis 1 and 4 weeks after immunization (n = 3; N = 1) 


and monocyte (Fig. 2J) memory in BALB/c- 
immunized B6-Rag’ “Tlarg/ ~ mice. By con- 
trast, PIR-A1/Fe did not inhibit the killing of 
BALB/c (H-2d) targets (Fig. 21). Furthermore, 
PIR-A3/Fc failed to suppress the monocyte 
memory response to C3H (H-2k) grafts (Fig. 
2J), which confirmed the specificity of PIR-A3 
to BALB/c H-2Dd molecules. Finally, we gen- 
erated PIR-A locus knockout (Pira’ ~) mice 
lacking PIR-A1-5 (fig. $2), which were bred 
onto the B6-Rag’ “Tare ~ background. Notably, 
these mice did not mount a monocyte memory 
response to BALB/c allografts 7 or 28 days after 
priming (Fig. 2K). By contrast, the primary 
response to BALB/c allografts (Fig. 2K) and pri- 
mary dendritic cell and macrophage functions 
(antigen presentation and IL-6 production) 
(fig. S3) were not inhibited in Pira’”~ recipients. 
Thus, PIR-A mediates innate memory. 

We next investigated how monocyte mem- 
ory is acquired and whether it resembles NK 
memory (J6). B6 mice immunized with BALB/c 
(H-2d) cells selectively increased the propor- 
tion and number of splenic monocytes that 
bind the H-2Dd tetramer, whereas C3H (H-2k) 
immunization exclusively increased H-2Dk- 
binding cells (Fig. 3A and fig. S4A). This effect 
was preserved in Pirb” mice but abolished in 
Pira’ mice. Only a minority of monocytes 
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bound more than one type of tetramer (fig. 
S4B), which suggests that PIR-A expression is 
variegated. Single-cell RNA sequencing (scRNA- 
seq) analysis of splenic monocytes performed 
land 4 weeks after immunization showed that 
monocytes followed a pseudotime trajectory 
from a starting state (S), which dominated in 
mice stimulated with syngeneic cells, to ex- 
panding states (El to E3), which increased 
proportionally in allostimulated groups (Fig. 3, 
B and C). E states were enriched for cell cycle, 
antigen presentation, and immune pathways 
such as allograft rejection and graft-versus-host 
disease (data S1 and S2) and exhibited increased 
Pira and reduced Pirb expression (Fig. 4D). 
Cell cycle pathway enrichment is consistent 
with prior evidence of monocyte proliferation 
after allostimulation (8). Finally, monocytes 
that expressed mRNA of two different Pira 
alleles constituted a small minority before and 
after allostimulation (Fig. 4E), which provides 
further evidence for variegated PIR-A expres- 
sion. Thus, a possible mechanism of monocyte 
memory is the clonal expansion of cells spe- 
cific to a particular nonself MHC molecule. 
Mo-DCs and macrophages contribute to al- 
lograft rejection in mice and humans (7, 17-19). 
However, it is unknown whether monocyte 
or macrophage memory mediated by PIR-A 
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plays a role in this rejection. Kidney allografts 
survived long-term (>125 days) and main- 
tained normal serum creatinine in lymphoid 
cell-sufficient wild-type and Pira’~ recipients, 
but they were rapidly rejected by Pirb”~ mice 
(Fig. 4, A and B), consistent with the inhibitory 
functions of PIR-B (72, 13). PIR-A3/Fe prevented 
acute allograft rejection in Pirb” mice (Fig. 4, 
A and B), which suggests that PIR-A accelerates 
rejection in the absence of PIR-B. Chronic 
rejection and cellular infiltrates in wild-type 
mice were significantly attenuated in Pira’ 
mice and in Pirb~ recipients treated with 
PIR-A3/Fe (Fig. 4, C and D, and fig. S5A). As 
expected, chronic rejection did not occur in 
grafts that were rejected acutely [Pirb’- and 
Pirb’”- plus control immunoglobulin G (Ctrl 
IgG) groups] (Fig. 4, C and D). We also observed 
a pathogenic role for PIR-A in heart transplan- 
tation. CTLA4-Ig or PIR-A3/Fc alone caused 
significant but modest prolongation of allo- 
graft survival, whereas their combined ad- 
ministration markedly extended survival and 
prevented pathology associated with acute or 
chronic rejection (Fig. 4, E and F, and fig. S5, 
B and C). Neither PIR-A deficiency nor block- 
ade suppressed alloantibody production (fig. 
S5, D and E). Thus, PIR-A, which is neces- 
sary for monocyte and macrophage memory, 
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Fig. 4. Genetic deletion 


or blocking PIR-A at- 
tenuates kidney and 
heart allograft rejection. 
(A to D) Survival (A), 
serum creatinine (B), his- 
tology (C), and chronic 
rejection scores (D) of 
BALB/c kidneys trans- 
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point to infiltrates and 
fibrosis. H&E, hematoxylin 
and eosin. (E and F) Sur- 
vival (E) and histology (F) 
of BALB/c hearts trans- 
planted to B6 mice treated 
with PIR-A3/Fc (+ CTLA4Ig) 
or control mouse IgGl 
(Ctrl IgG). Arrows in (F) 
point to chronic rejection 
vasculopathy. VVG, 
Verhoeff-Van Gieson. 
Statistical analyses: mean 
and individual biological 
replicates [(B) and (D)]; 
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promotes allograft rejection, whereas PIR-B 
tempers it. 

The findings described in this work identify 
a pathway of MHC-I allorecognition that is 
mediated by monocytes and macrophages, 
generates memory specific to previously en- 
countered MHC-I alloantigens, and contributes 
to allograft rejection. This pathway extends 
the domain of classical immunological mem- 
ory to innate myeloid cells and can be po- 
tentially targeted to improve the survival of 
life-saving organ transplants. 


REFERENCES AND NOTES 


1. A. Valujskikh, F. G. Lakkis, Immunol. Rev. 196, 65-74 
(2003). 

2. J. G. O'Leary, M. Goodarzi, D. L. Drayton, U. H. von Andrian, 
Nat. Immunol. 7, 507-516 (2006). 

3. J.C. Sun, J. N. Beilke, L. L. Lanier, Nature 457, 557-561 
(2009). 

4. M. G. Netea et al., Science 352, aafl098 (2016). 


Dai et al., Science 368, 1122-1127 (2020) 5 June 2020 


CTLA-4Ig 
+ PIR-A3/Fe CTLA-4Ig Syngeneic 


40 
Time (days) 


5. K. Bassler, J. Schulte-Schrepping, S. Warnat-Herresthal, 
A.C. Aschenbrenner, J. L. Schultze, Annu. Rev. Immunol. 37, 
269-293 (2019). 
6. D. Zecher, N. van Rooijen, D. M. Rothstein, W. D. Shlomchik, 
F. G. Lakkis, J. Immunol. 183, 7810-7816 (2009). 
7. M.H. Oberbarnscheidt et al., J. Clin. Invest. 124, 3579-3589 (2014). 
8. H. Dai et al., Sci. Immunol. 2, eaam6202 (2017). 
9. W. Liu, X. Xiao, G. Demirci, J. Madsen, X. C. Li, J. Immunol. 188, 
2703-2711 (2012). 
. T. Takai, Immunology 115, 433-440 (2005). 
ll. H. Kubagawa, P. D. Burrows, M. D. Cooper, Proc. Natl. Acad. 
Sci. U.S.A. 94, 5261-5266 (1997). 
. A. Nakamura, E. Kobayashi, T. Takai, Nat. Immunol. 5, 
623-629 (2004). 
. G. Ma et al., Immunity 34, 385-395 (2011). 
. A. Maeda, M. Kurosaki, T. Kurosaki, J. Exp. Med. 188, 991-995 
(1998). 
15. H. Kubagawa et al., J. Exp. Med. 189, 309-318 (1999). 
16. T. Nabekura, L. L. Lanier, J. Exp. Med. 211, 2455-2465 (2014). 
17. Q. Zhuang et al., Nat. Commun. 7, 12623 (2016). 
18. M. S. Braza et al., Immunity 49, 819-828.e6 (2018). 
19. |. Batal et al., J. Am. Soc. Nephrol. 26, 3102-3113 (2015). 


ACKNOWLEDGMENTS 


We would like to thank C. Bi and Z. Kou of the Transgenic and 
Gene Targeting Core (Department of Immunology, University of 


p=0.01 
"]P=0.008 


+ PIR-A3/Fe 


Serum Creatinine (mg/dl) 


Pirb-~ 


Chronic Rejection Score (%) 


Pittsburgh School of Medicine) for microinjection of zygotes and 
production of Pira- and Pirb-mutant mice. Funding: This work was 
supported by NIH grants Al145881 (M.H.O.), Al080779 (X.C.L.), 
and Al099465 (F.G.L.) and by funds from the Frank and Athena 
Sarris Chair in Transplantation Biology at the University of 
Pittsburgh (F.G.L.). J.S.D. was supported by the JDRF, the 


Canadian Institutes of Health Research, and the SickKids 
Foundation. Author contributions: H.D., P.L., D.Z., W.L., Wen.C., 
A.J.F., T.L., L.R.M., and H.R.T. performed experiments and analyzed 
data. K.A.-D., T.S., and J.C. performed bioinformatic analysis. 

Wei C. supervised bioinformatic analysis. A.L.W. and S.M.-T. 
performed mouse breeding and genotyping. J.S.D. supervised 
generation and genotyping of NOD congenic mice and edited the 
manuscript. C.W. and P.N. provided data and insights into human 
correlates of mouse findings. M.L.N. performed comparative 
mouse genomics and edited the manuscript. S.G. designed and 
constructed gene knockout mice and contributed to the 
manuscript writing. E.T. and H.K. designed, constructed, and 
provided expression vectors and reagents and edited the 
manuscript. M.J.S. supervised design and construction of Pirb-“ 
mice, provided the mice, and edited the manuscript. M.H.O. 
participated in experimental design, coordinated experiments, 
trained and supervised personnel, and wrote the manuscript. 
M.H.O., X.C.L., and F.G.L. conceived the idea, designed 
experiments, analyzed data, supervised overall work, and 


5 of 6 


RESEARCH | REPORT 


wrote the manuscript. Competing interests: The Hospital for 
Sick Children receives royalties from Trillium Therapeutics, Inc. 
for SIRPa-Fe, of which J.S.D. receives a fraction. J.S.D. is a 
coinventor on a patent related to this work, “Role of SIRP alpha 
polymorphisms in human acute myeloid leukemia.” PCT 
applications are pending in the United States, Canada, and 
China and have been awarded in Australia (2015) and Japan 
(2018). Data and materials availability: Raw scRNA-seq data 
have been deposited into the Gene Expression Omnibus (GEO) 


Dai et al., Science 368, 1122-1127 (2020) 5 June 2020 


database (accession no. GSE147596). All other data needed to 
evaluate the conclusions of the paper are available in the 
manuscript or the supplementary materials. 


SUPPLEMENTARY MATERIALS 
science.sciencemag.org/content/368/6495/1122/suppl/DC1 
Materials and Methods 

Figs. Sl to S5 

Tables S1 to S4 


References (20-28) 
Data S1 and S2 


View/request a protocol for this paper from Bio-protocol. 


20 March 2019; resubmitted 2 December 2019 
Accepted 10 April 2020 

Published online 7 May 2020 
10.1126/science.aax4040 


6 of 6 


RESEARCH 


MTOR signaling orchestrates stress-induced 
mutagenesis, facilitating adaptive evolution in cancer 
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David R. Croucher’, Alvaro Gonzalez Rajal’, Simon R. Junankar*, Darren N. Saunders®, 

Pavel Lobachevsky°, Anthony T. Papenfuss>*”?°, Danielle Nessem’, Max Nobis'”, Sean C. Warren’, 
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In microorganisms, evolutionarily conserved mechanisms facilitate adaptation to harsh conditions through 
stress-induced mutagenesis (SIM). Analogous processes may underpin progression and therapeutic failure in 
human cancer. We describe SIM in multiple in vitro and in vivo models of human cancers under nongenotoxic 
drug selection, paradoxically enhancing adaptation at a competing intrinsic fitness cost. A genome-wide 
approach identified the mechanistic target of rapamycin (MTOR) as a stress-sensing rheostat mediating SIM 
across multiple cancer types and conditions. These observations are consistent with a two-phase model for 
drug resistance, in which an initially rapid expansion of genetic diversity is counterbalanced by an intrinsic 


fitness penalty, subsequently normalizing to complete adaptation under the new conditions. This model 
suggests synthetic lethal strategies to minimize resistance to anticancer therapy. 


enetic diversity is central to adaptation 

and evolution in cancer. Mutagenesis in 

malignancies occurs incrementally (1, 2) 

or in transient bursts (3-7), providing 

increased cell-to-cell variation that facil- 
itates adaptation to selective pressures, including 
anticancer therapies. Paradoxically, mutagenesis 
induced by some therapies, such as ionizing 
radiation, forms a core component of treatment 
for most cancers. The evolutionary effects of 
mutagenesis appear to be context dependent; 
they are potentially harmful under favorable 
conditions yet beneficial under stressed con- 
ditions. Stress-induced mutagenesis (SIM) 
programs that link environmental conditions 
to genetic diversity are a common atavistic 
theme across the phylogenetic tree (8) and 
are well described in prokaryotes (9) and 
lower eukaryotes (10-13). 

Here, we sought evidence for cognate pro- 
grams in cancer. We first looked for accumu- 
lation of DNA damage (/4) in human cancers 
exposed to therapies not considered directly 
genotoxic. Phosphorylated histone H2AX 
(y-H2AX) was used as a marker of DNA double- 
strand breaks (DSBs) in pre- and posttreatment 
samples taken from diverse cohorts of chemo- 
and radiotherapy-naive cancer patients. In- 
creased DNA damage was consistently observed 


in prostate cancer patients treated with andro- 
gen deprivation therapy, in breast cancer pa- 
tients treated with the aromatase inhibitor 
letrozole, in melanoma patients treated with 
the BRAF inhibitors dabrafenib or vemur- 
afenib, and in patients with gastrointestinal 
stromal tumors treated with the KIT inhibi- 
tor imatinib (Fig. 1A and fig. S1). DNA DSBs 
were also increased in patient-derived xeno- 
graft (PDX) pancreatic cancer models treated 
with the CDK4/6 inhibitor palbociclib and 
the epidermal growth factor receptor (EGFR) 
inhibitor erlotinib, suggesting that increased 
DNA damage is a recurrent feature in human 
cancers exposed to nongenotoxic therapies. 

We interrogated this phenomenon further 
in vitro using multiple human cancer cell lines 
exposed to nongenotoxic drug selection. Drugs 
specific to genetic targets in each cell line were 
titrated to near-extinction conditions, consist- 
ently generating one to 20 resistant colonies 
per 100,000 cells (Fig. 1B and table S1), which 
were analyzed for genetic and molecular fea- 
tures, as well as drug sensitivity (fig. S2). All 
model systems displayed increased DSBs early 
during evolution (Fig. 1C), which decreased to 
baseline during subsequent culture (fig. S3). 
The levels of DSBs were stable in the absence 
of selection. 


To further characterize this phenomenon, we 
undertook whole-genome sequencing (average 
read depth 116x) on single cell-derived clonal 
populations obtained from untreated and early- 
phase, drug-resistant 94778 human liposarcoma 
and SKMEL28 human melanoma lines. Phyloge- 
netic trees were computed on the basis of copy 
number changes, revealing in both lines distinct 
clusters of untreated and drug-resistant clones 
sharing a common trunk. The observed lengths 
of the trunks (0.414 and 0.539 arbitrary units for 
941778 and SKMEL28 clones, respectively) were 
significantly longer than the average branch 
lengths of the respective parental clones (0.118 
in 941778, P = 1.8e-27, and 0.209 in SKMEL28, 
P =1.4e-6), consistent with accelerated ge- 
nomic evolution induced by the therapeutic 
bottleneck (Fig. 1D and figs. S4 and S5). To 
formally estimate mutation rates, we expanded 
clones using an equal number of generations 
(20 for 941778; 21 for SKMEL28). Taking into 
account differential senescence and cell death 
(figs. S6 and S7, tables S2 and S3, and materials 
and methods), we quantified the percentage of 
de novo single nucleotide variants (SNVs) as 
the number of subclonal SNVs divided by 
the number of total unique SNVs. Intraclonal 
diversity, as measured by the number of de novo 
SNVs, was significantly higher in the resistant 
populations in both cell lines (Fig. 1E), consistent 
with higher mutation rates. We sought evi- 
dence for resistance-specific mutational signa- 
tures but observed no consistent patterns (fig. 
S8). We also performed targeted sequencing 
(figs. S9 to S11) on bulk populations, which 
revealed transiently increased structural and 
single nucleotide variation early during drug 
selection. 

To further characterize the dynamics of ge- 
nomic instability, the 94T778 cell line, con- 
taining two amplified targetable oncogenes 
(CDK4 and MDM2), was exposed to palboci- 
clib or to nutlin-3a, an inhibitor of the p53- 
MDM2 interaction. Cells were transduced at 
both early and late time points of evolution 
with a single-copy fluorescent reporter gene 
(nCherry). In this system, genomic alterations 
lead to loss of gene expression. In agreement 
with the targeted sequencing data, drug- 
resistant cell populations assayed early during 
evolution demonstrated a significant loss of 
fluorescence compared with baseline or late 
time points (fig. S12). To approximate the 
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relative contribution of de novo versus preex- 
isting genetic variation to drug resistance, 
clonal population dynamics were tracked by 
engineering 94T778 and the human melanoma 
A375 cell line with a high-complexity DNA 
barcode library (15). Although we observed 
selection of preexisting genetic variation, the 
high frequency of individual barcodes (94T778 
line average 69.6%, range 42.9 to 93.3%; A375 
line average 17.9%, range 3 to 64.7%) is 
consistent with a contribution of de novo 
mutagenesis to resistance (fig. S13). This is also 
suggested by the late (87 weeks, T3) emergence 


of subclonal 7P53 mutations (C277F, R273L) 
induced by nutlin-3a, neither of which is visible 
at weeks 11 (T1) or 21 (T2) (fig. S11). 

As noted, mutagenesis may directly entail a 
fitness penalty driven by the accumulation of 
deleterious mutations. To study the counter- 
selective effects of SIM, we examined the 
clonogenic potential of both parental and 
resistant lines in the absence of drug selec- 
tion. We observed universally decreased clono- 
genicity relative to drug-naive controls (Fig. 1F). 
Consistent with these effects, colonies obtained 
from nutlin-3a-resistant 94T778 contained up 


to 45% senescent cells at 6 weeks, falling to 6% 
at 25 weeks, compared with 1.5% in untreated 
controls (fig. S14). Extinction events may ex- 
plain the loss of some initially resistant clones; 
for example, the C238F resistance mutation in 
TP53 was detected at 11 weeks under nutlin-3a 
selection but was lost thereafter (fig. S11). These 
data support a SIM-induced intrinsic fitness 
penalty early during adaptation. 

To identify common genetic pathways me- 
diating SIM in human cancer, we conducted 
a genome-wide functional screen using the 
94T778 cell line. The presence of multiple 


A le B Pharmacological pressure Cc 
io 947778 nutlin-3a —. 
2.8] 9 » ~ Selection Evolution 941778 palbociclib A 
g o Els ; +. (6-8 weeks) (35-48 weeks) 947778 tunicamycin * 
335/38 i ee wig * A375 vemurafenib ee 
aeesye ise SKMEL28 vemurafenib —e 
2.6 i —. FC: 0.95 AU565 lapatinib | ¢ 
i @ (0.16, 0.19] SKBR3 lapatinib FC: 0.74 
i——_e— overall [0.59, 0.88] 
_— 
e—| ° t Bottleneck 0) 1 2 
i ere 3x108 cells Log, fold change 
-€. n. DNA DSBs 
x) ~ i oe 
8 g i oe D 947778 SKMEL28 
5 oo | —e-— 0.8 0.6 0.4 0.2 ie) 0.8 0.6 0.4 0.2 0 
——o—_ | 
alg © P5 P3 
H —@ FC: 1.32 
° — [0.25, 0.30] 
le 
e b Pat 
@ an : oe FC: 0.19 Bottleneck Ri 
—e |? eo! R2 R5 
= o [0.15, 0.17] R2 
t 0.209 0.539 RI 
i —eo— =F 
aid #% ae ion 4 
= SE P3 P=14e6 pe 
so |e a 0.118 0.162 0.117 
85 |§ Le 
SS 2 bo a. ok P = 1.8¢-27 
3:6 = ie 
Eo 2 H e 
s° . — +e E F Parenta 
¢ FC: 0.83 P = 0.0058 Nutlin-3a (week 8) 
7? (0.20, 0.23] 947778 SKMEL28 P = 0.000 Palbociclib (week 7) | 941778 
2 —o— P = 0.027 P = 0.003 P < 0.000 Tunicamycin (week 9) 
S| \2 ie ~—OFC0.92 2 40 fas) ; a Parental a 
mn |S _. (0.29, 0.28] ao ES nea < 0.000 Vemurafenib (week 10) 
——+ 5 i — Parental 
; to} 
= é > H oe s a P < 0.000 Lapatinib (week 9) AuSeS 
B ls i oe 2 10 
gi - —e— i 6 
8 7 H ae xe i P2renta SKMEL28 
° c =2 i 2 ~ . P = 0.000 Vemurafenib (week 10) 
 jé a eS § ¥ 
D. t @ 
—e | ee FC: 0.84 & & & ——— Parental oe 
? [0.20, 0.26] ws we P < 0.000 Lapatinib (week 8) 
2 5 an -2 0 2 4 100 80 60 40 20 0O 
A ES Log, fold change n. DNA DSBs Clonogenic efficiency (%) 
() 
Fig. 1. Selection with targeted anticancer therapies results in genetic untreated parental cells (mean + SD). (D) Phylogenetic analysis of CNVs 


diversity. (A) Assessment of nuclear DNA DSBs in clinical samples and in PDX __ observed in single cell-derived clonal populations obtained from early-phase, 


pancreatic cancer models. Shown is the fold change (mean + SD) relative to 


tunicamycin-resistant 941778 and vemurafenib-resistant SKMEL28 lines. The 


pretreatment samples in patients and vehicle-treated controls in PDX models. values below each set of parental (P) and drug-resistant (R) clones represent the 


Blue and black symbols represent negative and positive changes, respectively. 
Black diamonds indicate the overall fold change (FC) with 95% confidence 
intervals (95% Cls). (B) Schematic illustrating the time spans of the selection 


average branch length. Values below the common trunks (orange) indicate 
their length. P values were obtained from Z tests. (E) Quantification of de novo 
SNVs generated during the expansion of the clonal populations. P values were 


and evolution phases of in vitro model systems. Shaded area represents obtained from quasibinomial logistic regression. (F) Colony formation assays 


total population size, and lines represent the fate of individual clonal lineages 
after bottleneck. Dashed lines indicate extinction events; solid lines indicate 
evolution and clonal expansion. (C) Fold change values of DNA DSBs relative to 
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performed in the absence of drug selection revealing the impact of the adaptive 
response on cellular fitness during the early phase of evolution. P values were 
obtained from two-tailed t test relative to drug-naive parental cells. 
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critical drug targets in these cells allowed us 
to identify shared genetic determinants of 
resistance. Cells were engineered with a whole- 
genome short hairpin RNA (shRNA) library 
and subjected to five different selective con- 
ditions (Fig. 2A): (i) culture for 2 weeks with- 
out selection to estimate selective drift; (ii) 
equivalently titrated near-extinction selective 
conditions using nutlin-3a; (iii) palbociclib; 
(iv) tunicamycin targeting the unfolded pro- 
tein response; and (v) incrementally increas- 
ing concentrations of nutlin-3a (to determine 
the relevance of selective stringency). 
Initially, the screen identified condition- 
specific enrichment of shRNAs for genes known 
to confer resistance, as well as previously un- 
known genes, in these important pathways 
(“solution” genes). For examples of known 


resistance genes, shRNAs targeting 7P53 and 
RBI were among the top-ranked genes for the 
nutlin-3a and palbociclib conditions, respec- 
tively (table S4 and fig. S15). This provided 
evidence that the assay was performing as 
expected. We next looked for genes with 
shRNAs that were enriched across all near- 
extinction conditions (“facilitator” genes). Fa- 
cilitator genes were associated with lower 
levels of enrichment compared with solution 
genes (table S5), which was perhaps due to an 
indirect role in mediating resistance through 
subsequent stochastic events, or possibly to 
counterselection. Top-ranked among facilita- 
tors were the genes encoding the Chaperonin 
Containing TCP1 Subunit 3 (CC73) and the 
RNA-processing factor Heterogeneous Nuclear 
Ribonucleoprotein L (HNRNPL) (16). CCT3 has 


an important role in telomere maintenance 
(7, 18) and genome stability (79), and hnRNP 
protein-mediated alternative splicing has a role 
in cancer progression (20). The next most en- 
riched gene was that coding for the mechanistic 
target of rapamycin (MTOR) (table S6 and fig. 
S16). In addition, the analysis revealed the en- 
richment for shRNAs directed to positive reg- 
ulators of MTOR and the depletion of hairpins 
targeting the negative regulator PTEN (Fig. 2B). 

The MTOR signaling pathway is widely ex- 
pressed in human tissues and functions as an 
evolutionarily conserved sensor of environ- 
mental and endogenous stress (21-23). The 
94T778, SEMEL28, and human breast cancer 
SKBR3 cell lines were engineered with MTOR- 
directed shRNAs and were exposed to tunica- 
mycin, vemurafenib, and the EGFR/HER2 
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Fig. 2. Whole-genome RNAi screen identifies MTOR as a common evolutionary 
capacitor in cancer. (A) Schematic diagram of the genome-wide RNAi screen. 
(B) Enrichment (left panel) and depletion (right panel) of hairpins targeting 
components of MTOR signaling. Genes are ranked by q values, with genes in red 
and light red being highly significant and approaching significance, respectively. 
The dashed line indicates the 0.05 statistical significance threshold. (€) Silencing 
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of MTOR signaling imposes a fitness penalty in normal culture conditions (white 
bars) but fosters adaptability in response to pharmacologic pressures (gray bars). 
P values were obtained from two-tailed t test versus nonsilencing (NS) controls. 
(D) Silencing of MTOR affects therapeutic responsiveness to palbociclib in PDX 
pancreatic cancer models. P values were obtained from two-tailed t test versus first 
time point after the end of the second treatment cycle. 
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inhibitor lapatinib, respectively. All cell lines 
showed a reduction in MTOR protein levels 
and a reduced phosphorylation of the down- 
stream kinase p70-S6K (fig. S17A). In addi- 
tion, we observed a transcriptional repression 
of genes positively regulated by MTOR and 
an increased expression of genes repressed by 
MTOR (24) (fig. S17B). As expected, MTOR 
silencing impaired clonogenicity in the absence 
of selection, but paradoxically increased clono- 
genicity under drug selection (Fig. 2C and 
supplementary materials), recapitulating the 
models described earlier. This effect was not 
caused by altered drug sensitivity in short-term 
assays (fig. S18). To exclude off-target effects 
from gene knock-down, these experiments 
were validated using shRNAs targeting the 
3' untranslated region of MTOR. The stress- 
induced adaptive potential was abrogated 
using an RNA interference (RNAi)-resistant 
open reading frame of the gene (fig. $19). In vivo, 
repression of MTOR accelerated emergence 
of palbociclib resistance in a PDX pancreatic 
cancer model (Fig. 2D) despite enhancing the 
effect of palbociclib in short-term ex vivo assays 
(fig. S20). Consistent with previously described 
kinetics of mutagenesis and impaired intrinsic 
fitness, a time course analysis of MTOR and 
p70-S6K phosphorylation levels revealed a sig- 
nificant reduction early during evolution and 
a restoration to pretreatment levels late during 


Because inhibition of MTOR impairs the 
DNA-damage response (25, 26), we examined 
DNA DSBs in 94T778 cells engineered with 
MTOR-specific shRNAs. The analysis of con- 
focal microscopy image data revealed elevated 
levels of DSBs comparable to those seen in cells 
spontaneously resistant to nutlin-3a (fig. S22A). 
Furthermore, the selective MTOR inhibitor 
torkinib (pp242) delayed the repair of ioniz- 
ing radiation-induced DNA DSBs, assessed 
as the number of y-H2AX (fig. S22B) and 53BP1 
(fig. S22C) foci. To quantitate the effects of 
MTOR silencing on mutagenesis, we under- 
took whole-genome sequencing on single cell- 
derived clonal populations obtained from 
94T778 cells expressing nonsilencing or MTOR- 
specific shRNAs. We found a significant increase 
in intraclonal diversity in MTOR-silenced cells 
(figs. S23 to S25, tables S2 and S3, and materials 
and methods). No MTOR-specific mutational 
signatures were evident in 94T778 clonal 
populations (fig. S8). In our lines, pharmaco- 
logic inhibition of MTOR repressed expres- 
sion of the homologous recombination (HR) 
and Fanconi anemia pathways and selectively 
repressed high-fidelity, but not error-prone 
DNA polymerases (Fig. 3A and table $7). 

To ascertain whether similar transcriptional 
changes are elicited across a broader range 
of cancer types, we interrogated the LINCS 
L1000 database, a transcriptome dataset of 


tiple doses and time points (27). Almost iden- 
tical patterns were seen in multiple cell lines 
exposed to three different MTOR inhibitors 
(fig. S26), as well as in 94T778 cells develop- 
ing spontaneous drug resistance, as described 
earlier (Fig. 3B and table S8). In the LINCS 
L1000 database, multiple cytostatic drugs sim- 
ilarly repress accurate DNA repair (fig. S27). 
This suggests that inhibiting cell proliferation 
may affect genetic diversity through coor- 
dinated regulation of multiple components of 
DNA repair. To test whether altered DNA re- 
pair would enhance adaptation, we reinterro- 
gated the genome-wide screen and found 
specific enrichment for shRNAs targeting HR 
genes (table S9). To determine whether the 
repression of HR directly facilitates drug re- 
sistance, we engineered 94T778, SKBR3, and 
SKMEL28 cells with shRNAs targeting BRCAI, 
BRCA2, and PALB2 and subjected the cells to 
selection. Recapitulating both spontaneous drug 
resistance models and silencing of MTOR, im- 
pairment of HR reduced clonogenicity in the 
absence of selection and increased clonoge- 
nicity under pharmacologic pressure (Fig. 3C 
and supplementary materials). The induction 
of a defective DNA-damage response by non- 
genotoxic cytostatic therapies suggests synthetic 
lethality-based strategies. In a preliminary proof- 
of-concept experiment, we assessed the impact 
of palbociclib combined with a poly(ADP-ribose) 
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Fig. 3. Inhibition of MTOR fosters adaptive mutagenesis by repressing 
accurate DNA repair. (A) Transcriptional changes induced by torkinib (pp242). 
P values were obtained from one-sided Wilcoxon rank-sum test and are shown in 
table S7. (B) Transcriptional changes observed during the early phase of the 
adaptive evolution in 941778 cells exposed to palbociclib (P), nutlin-3a (N), and 
tunicamycin (T) sampled at 10 weeks for P and N and at 9 weeks after bottleneck 
for the T condition. P values were obtained from one-sided Wilcoxon rank-sum 
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Time (days) 


test and are shown in table S8. (C) Effect of silencing of HR genes in the absence 
(white bars) or presence of pharmacologic pressures (gray bars). P values were 
obtained from two-tailed t test versus NS control shRNA. (D) Effect of palbociclib and 
rucaparib combination therapy on tumor growth inhibition (bar plot, 30 days) and 


plot) in a PDX pancreatic cancer model. P values for 


tumor growth were obtained from two-tailed t test; a log-rank test on palbociclib + 
rucaparib versus palbociclib monotherapy was used in the Kaplan—Meier plot. 
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Fig. 4. Conserved mechanisms underpinning SIM in human cancer. (A) Selection by cytostatic 
anticancer therapies leads to increased levels of DNA damage and impaired mTOR signaling, providing 
genetic diversity and accelerating adaptation to anticancer treatments. (B) During the initial phase of 
adaptation, the fitness landscape is a dynamic balance between the deleterious effect of intrinsic selection 
and the beneficial effect conferred by resistant genomic configurations. Fit genotypes are progressively 


stabilized during the second equilibrium phase. 


pancreatic model. Combination therapy en- 
hanced the antitumor effects compared with 
either agent alone. (Fig. 3D). 

Collectively, these data reveal SIM as a 
common mechanism facilitating the evolution 
of human cancer and are broadly in agree- 
ment with recent observations in colorectal 
cancer (28). As with error-prone and faithful 
DNA polymerases, the advantage conferred 
by SIM in a metazoan context is not clear 
and may simply be an evolutionary remnant. 
MTOR-mediated SIM slows replication and 
fosters genomic instability by impairing accu- 
rate DNA repair, thereby enhancing genetic 
diversity and facilitating resistance to ther- 
apy (29) (Fig. 4A). Both suppression of MTOR 
signaling and DNA repair also directly confer 
an intrinsic fitness penalty by inhibition of 
growth signaling and by increasing the ac- 
cumulation of deleterious mutations, respec- 
tively. Rather than being a simple process, our 
data suggest that drug resistance may be the 
compound result of two independent but re- 
lated mechanisms. Initially, net fitness balances 
the intrinsic fitness penalty of MTOR-mediated 
SIM with the generation of resistant genotypes 
undergoing extreme selection. Subsequently, 
normalization of SIM and fixation of stably 
resistant genomic configurations establish a 
new adaptive equilibrium (Fig. 4B). From a clin- 
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ical perspective, our findings may explain the 
observation that agents targeting MTOR or 
the upstream phosphoinositide 3-kinase path- 
way have greater effects on objective responses 
or progression-free survival than on overall sur- 
vival (30). From a drug development perspective, 
enhanced adaptation leading to drug resistance 
is undetectable by most preclinical assays of 
anticancer activity or by short-term measures 
of objective responses in trials. Our findings 
also provide a rational framework for synthetic 
lethal combinations of cytostatic agents with 
genotoxic therapies. Such combinations could 
potentially generate a lethal mutational load 
during the initial phase of adaptive evolution, 
thereby reducing therapeutic failure. 
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NELL2-mediated lumicrine signaling through OVCH2 
is required for male fertility 


Daiji Kiyozumi?”, Taichi Noda", Ryo Yamaguchi~°, Tomohiro Tobita~“, Takafumi Matsumura”*, 
Kentaro Shimada**, Mayo Kodani*°, Takashi Kohda’®, Yoshitaka Fujihara2*, Manabu Ozawa‘®, 
Zhifeng Yu’, Gabriella Miklossy’, Kurt M. Bohren’, Masato Horie®, Masaru Okabe*, 

Martin M. Matzuk?920-+, Masahito Ikawa??346+ 


The lumicrine system is a postulated signaling system in which testis-derived (upstream) secreted 
factors enter the male reproductive tract to regulate epididymal (downstream) pathways required for 
sperm maturation. Until now, no lumicrine factors have been identified. We demonstrate that a testicular 
germ-cell-secreted epidermal growth factor-like protein, neural epidermal growth factor-like-like 2 
(NELL2), specifically binds to an orphan receptor tyrosine kinase, c-ros oncogene 1 (ROS1), and mediates 
the differentiation of the initial segment (IS) of the caput epididymis. Male mice in which Nell2 had 
been knocked out were infertile. The IS-specific secreted proteases, ovochymase 2 (OVCH2) and 

A disintegrin and metallopeptidase 28 (ADAM28), were expressed upon IS maturation, and OVCH2 was 
required for processing of the sperm surface protein ADAM3, which is required for sperm fertilizing 
ability. This work identifies a lumicrine system essential for testis-epididymis-spermatozoa (NELL2-ROS1- 
OVCH2-ADAM3) signaling and male fertility. 


of this lumicrine system have not been iden- 
tified. An orphan receptor tyrosine kinase— 
c-ros oncogene 1 (ROS1), which is well known 
as oncogenic when its gene is rearranged to 


ex hormones, especially androgens, play 
critical roles in male genital tract de- 
velopment and function. In addition to 
androgens, it is hypothesized that pro- 


teins and/or small molecules produced 
in the mammalian testis (upstream) are se- 
creted into the luminal space and are trans- 
ported to the epididymis to regulate, maintain, 
and/or differentiate the epididymis (down- 
stream) (7). However, the molecular entities 


Fig. 1. Testicular NELL2 is indispensable for 
ROS1-mediated differentiation of epididymal IS. 
(A) In vitro binding of ROS1 with NELL2 and 
several other indicated proteins. (B) Reverse 
transcription polymerase chain reaction analyses 
of Nell2 expression in adult organs and in 
postnatal testis. Hprt is also shown as an internal 
control. (C) Litter sizes of Nell2 KO mice. 
Average and SD are shown. (D and E) Hematoxylin 
and eosin (H&E) staining of caput epididymis 

of 14-week-old (D) WT and (E) Nell2 KO mice. 
Scale bar, 50 um. (F to H) Phospho-ERK 
immunoblot (F) and immunofluorescence 
analyses of (G) WT and (H) Nell2 KO 
14-week-old caput epididymis. Scale bar, 

50 um. (1) Expression of transcription factors 
downstream of ERK signaling in 14-week-old 

WT and Nell2 KO caput epididymis. RPKM, 
reads per kilobase per million. 
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form active fusion proteins—is expressed in 
the initial segment (IS) of the caput epididymis 
and is indispensable for postnatal IS differen- 
tiation (2). Neither the testicular factors that 
regulate IS differentiation nor the mechanisms 
by which differentiated IS matures spermato- 
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zoa has been fully clarified. In this study, we 
identified a testis-secreted factor that binds to 
ROS1 and is indispensable for IS differentiation 
and male fertility. 

The dependency of IS on testicular lumi- 
crine factors is reported to be established be- 
tween postnatal day 15 (P15) and P19 (3). We 
found that germ cell-deficient Kit” (W/W”) 
mice also show impaired IS differentiation (fig. 
Sl). These observations suggest that the puta- 
tive ROSI ligand is expressed by developing 
testicular germ cells (TGCs). To identify the 
testicular ligand for epididymal orphan re- 
ceptor ROSI, we screened for testis-secreted 
proteins that are up-regulated during sper- 
matogenesis (fig. $2). First, from GSE640 
Microarray data (36,939 spots), we selected 
1706 spots as matrisome genes in silico (4). 
Second, we chose nine candidate proteins that 
satisfied the criteria that we set (proteins with 
N-terminal signal sequence, no transmem- 
brane domain, and average transcript levels 
more than three times higher after P18 com- 
pared with those before P14). We excluded 
seven genes in which knockout (KO) mice 
lacking these proteins were fertile (Comp, 
Vit, Zp3r, Clqtnf4, and Agt) (5-9) or showed 
different phenotypes from Rosi] KO mice 
(Unsl6 and Prss21) (10, 11). Because PRSS39 is a 
protease and less likely to be a ROS1 ligand, 
we focused on neural epidermal growth factor- 
like-like 2 (NELL2), which satisfied the criteria 
above and could be a hypothetical lumicrine 
factor and regulator of male fertility. 
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Recombinant NELL2 protein specifically 
bound recombinant ROS1 in vitro, whereas 
recombinant cartilage oligomeric matrix pro- 
tein (COMP), which carries epidermal growth 
factor-like domains, did not (Fig. 1A and fig. S2). 
In wild-type (WT) mice, Nell2 is expressed in 
testis and brain but not in epididymis (Fig. 1B 
and fig. S1). In W/W’° mice, Nell2 is expressed in 
brain, but the IS does not differentiate (fig. S1). 
Therefore, we focused on testis as the source of 
NELL2 and identified spermatocytes as Nell2- 
expressing cells with single-cell RNA-sequencing 
(RNA-seq) data (fig. S1) (72), which is consistent 
with increased Nell2 transcript levels from P14: 
(fig. S2). Because spermatocytes are located 
inside the blood-testis barrier in seminiferous 
tubules, these data suggest that NELL2 is se- 
creted from TGCs, transits through the lumi- 
nal space of seminiferous tubules, and binds to 
ROSI on the epididymal epithelial cells. 

To investigate NELL2 functions in vivo, we 
generated mice in which Nell2 had been 
knocked out (Nell2 KO mice) using CRISPR- 
Cas9 (fig. S3) and found that Nell2 KO males 
are sterile (Fig. 1C). A Clgn-Nell2 transgene, 
which expresses Nell2 in a TGC-specific man- 
ner (13), completely rescued the male infer- 
tility of the Nell2 KO mice, again excluding a 
possibility that NELL2 expressed in the brain 
regulates fertility (fig. S4). We detected testis- 
secreted His-tagged NELL2 in the luminal con- 
tents of the epididymal duct, supporting that 
NELL2 is a lumicrine factor (fig. S4). 

To determine the cause of infertility in Nell2 
KO males, we examined their reproductive 
tracts. Spermatogenesis did not appear to be 
affected in the Nell2 KO testis (fig. S3). By 
contrast, the IS was poorly differentiated in 
Nell2 KO males (Fig. 1, D and E, and figs. S3 
and S5), similar to Ros1 KO mice (2). IS differ- 
entiation begins at 2 to 3 weeks of age and 
corresponds with spatiotemporal Nell2 ex- 
pression in testis. Postnatal differentiation 
of the IS was completely abolished in Nell2 KO 
males and continued throughout life (fig. S5). 

Extracellular signal-regulated kinase 1/2 
(ERK1/2), a signal mediator whose phosphoryl- 
ation depends on testicular lumicrine factors 
(14), is up-regulated from 2 to 3 weeks post- 
natally in parallel with IS differentiation (5). 
Whereas the ERK1/2 signal is down-regulated 
in Ros] mutant epididymis (15), it is hypo- 
phosphorylated in Nell2 KO caput epididymis 
(Fig. 1, F to H). Transcription factors Etv1/, 
Etv4, and Etv5, which are located downstream 
of ERK signaling, are down-regulated in Nell2 
KO IS (Fig. 11). These results suggest that the 
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Fig. 2. NELL2 is required for ADAM3 maturation and subsequent sperm fertilizing ability. (A to 

F) Migration of [(A) to (C)] green fluorescent protein-tagged WT and [(D) to (F)] Nell2 KO sperm ejaculated 
into WT female reproductive tract. (G and H) Sperm-ZP binding assay with (G) WT and (H) Nell2 

KO spermatozoa. Scale bars, 100 um. (1) Immunoblot detection of cauda epididymal sperm lysates. 

(J) Immunoblot detection of ADAM3 in TGC; TS (testicular spermatozoa); and caput, corpus, and cauda 
epididymal spermatozoa. (K) Expression of genes encoding proteases in WT and Nell2 KO caput 
epididymis. Only genes whose average read counts in WT are >100 are shown. (L) Immunoblot analysis 
of caput epididymal protein lysates from WT, Nell2 KO, and Ros1 KO males. 


molecular basis of impaired IS differentiation 
is common between Nell2 KO mice and Ros1 
KO mice. 

We next characterized the relationship be- 
tween impaired IS differentiation and male 
infertility. When spermatozoa ejaculated into 
female reproductive tract were visualized with 
an Acr-egfp transgene (16, 17), we observed 
Nell2 KO spermatozoa in the uterus but not 
in the oviduct (Fig. 2, A to F), indicating that 


the Nell2 KO spermatozoa are deficient in 
the passage through the uterotubal junction 
(UTJ), as observed in Ros] KO mice (8). Further, 
both KO mice showed lack of sperm-binding 
ability to zona pellucida (ZP), the oocyte extra- 
cellular matrix (Fig. 2, G and H, and fig. S6). 
The inability of spermatozoa to pass through 
the UTJ and to bind to the ZP is often caused 
by loss of mature A disintegrin and metal- 
lopeptidase 3 (ADAM3), a spermatozoa surface 


Nmmunology Frontier Research Center, Osaka University, Suita, Osaka 5650871, Japan. Research Institute for Microbial Diseases, 


School of Pharmaceutical Sciences, Osaka University, Suita, Osaka 5650871, Japan. “Graduate School of Medicine, Osaka Universi 


Osaka University, Suita, Osaka 5650871, Japan. °Graduate 
y, Suita, Osaka 5650871, Japan. °Faculty of Life and 


Environmental Sciences, University of Yamanashi, Kofu, Yamanashi 4008510, Japan. °The Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo 1088639, Japan. ’Center for 


Drug Discovery and Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX 77030, USA. *Departmen 


of CNS Research, Otsuka Pharmaceutical, Kawauchi-cho, 


Tokushima 771-0192, Japan. °Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA. ‘Department of Molecular and Human Genetics, Baylor College 
of Medicine, Houston, TX 77030, USA. “Department of Pharmacology and Chemical Biology, Baylor College of Medicine, Houston, TX 77030, USA. 


*Present address: Department of Bioscience and Genetics, National Cerebral and Cardiovascular Center, Suita, Osaka 5648565, Japan. 
tCorresponding author. Email: mmatzuk@bem.edu (M.M.M.); ikawa@biken.osaka-u.ac.jp (M.I.) 


Kiyozumi et al., Science 368, 1132-1135 (2020) 5 June 2020 


2 of 4 


RESEARCH | REPORT 


Fig. 3. Epididymal secreted protease OVCH2 is 
required for ADAM3 processing and male 
fertility. (A and B) H&E staining of IS sections of 
epididymis from (A) WT and (B) Ovch2 KO mice. 
Scale bar, 100 um. (C) Immunoblot detection 

of cauda epididymal sperm lysates. (D) Immuno- 
blot detection of ADAM3 in TGC, TS, caput, 
corpus, and cauda epididymal spermatozoa. 

(E) Litter size analysis of Ovch2 KO mice. Average 
and SD are shown. (F to K) Migration of [(F) to 
(H)] dsRed2-tagged WT and [(I) to (K)] Ovch2 
KO spermatozoa ejaculated into WT female 
reproductive tract. (L and M) Sperm-ZP binding 
assay with (L) WT and (M) Ovch2 KO spermatozoa. 
Scale bars, 100 um. 
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transmembrane protein, in cauda epididymal 
mature spermatozoa (19). ADAMS3 is expressed 
in TGCs as a precursor N-linked glycosylated 
protein of ~100 kD and becomes processed by 
an unknown mechanism to a mature form of 
~25 kD during sperm transit through the epi- 
didymis (20). We found that precursor ADAM3 
is not processed correctly to mature ADAM3 
in Nell2 KO cauda epididymal spermatozoa 
(Fig. 2, I and J). Expression of other fertility- 
related sperm proteins was not critically af- 
fected in Nell2 KO testis and spermatozoa (Fig. 
2I and fig. S3). Transgenic rescue of Nell2 KO 
males with testis-specific NELL2 expression 
restored sperm ADAM3 processing (fig. S4). 
These results indicate that Nell2 KO epidid- 
ymis is missing a key protease involved in 
proper ADAMS processing. 

We hypothesized that a luminal environ- 
ment suitable for the maturation of spermato- 
zoa, including ADAM3 processing, is provided 
by fully differentiated IS but is impaired in 
Nell2 KO epididymis. We examined the ex- 
pression of genes encoding proteases in caput 
epididymis by means of RNA-seq analysis (Fig. 
2K and fig. S7). With RNA-seq and immuno- 
blot analyses, we found that two proteases, 
OVCH2 (ovochymase 2) and ADAM28, were 
strongly expressed in WT caput epididymis 
(fig. S7) but highly down-regulated in Nell2 
KO caput epididymis at the transcript and pro- 
tein levels (Fig. 2, K and L). As previously re- 
ported with ADAM28 (27), recombinant OVCH2 
showed protease activity in vitro (fig. S7). 

To determine whether epididymal secreted 
proteases are necessary for ADAM3 process- 
ing and male fertility, we generated Ovch2 
KO and Adam28 KO mice using CRISPR-Cas9 
(Fig. 3 and figs. S8 and S9). KO mice lacking the 
epididymis-specific protease OVCH2 partially 
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reproduced the Nell2 KO phenotype; although 
IS differentiation was not compromised in 
Ovch2 KO mice (Fig. 3, A and B), ADAMS pro- 
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cessing in the Ovch2 KO spermatozoa was 
abnormal (Fig. 3, C and D), and Ovch2 KO 
male mice were sterile (Fig. 3E) because of 
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inability of the sperm to transit the UTJ (Fig. 3, 
F to K) and bind to the ZP (Fig. 3, Land M). By 
contrast, Adam28 KO males were found to 
be fertile (fig. S9). Consistent with our model 
that OVCH2 is downstream of NELL2, Ovch2 
was not only epididymis-specific (fig. S7), 
OVCH2 protein was exquisitely localized to 
the IS of the caput epididymis (Fig. 4A), most 
proximal to the testis. These results indicate 
that epididymal-specific and secreted pro- 
tease OVCH2 is required for sperm ADAM3 
processing and consequential sperm fertiliz- 
ing ability. 

This study indicates the presence of a 
lumicrine signal pathway in which a testis- 
secreted protein (NELL2) signals through a 
ROS1-pathway to regulate the epididymal IS 
maturation and subsequent secretion of an 
epididymis protease (OVCH2) that processes 
ADAMS for sperm fertilizing ability (Fig. 4, B 
to D). NELL2 is produced by developing germ 
cells in testis and transits through the luminal 
space to the IS of the epididymis to trigger its 
differentiation. The differentiated IS then 
secretes many proteins, including proteases 
(such as OVCH2) that act on sperm ADAM3 to 
make spermatozoa fully functional. Because 
various genes are down-regulated in Nell2 
KO caput epididymis (fig. S7), it is very likely 
that epididymal luminal factors other than 
proteases also regulate sperm maturation in a 
different way but downstream of the NELL2- 
ROS1 lumicrine system. Likewise, because 
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OVCH2 is required for male fertility, OVCH2 
may process other sperm and/or epididymis 
proteins required for sperm maturation. The 
lumicrine concept that we have demonstrated 
here in the male reproductive tract may be 
applied more broadly to other organs in which 
luminal flow occurs. 
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Barcoded microbial system for high-resolution 
object provenance 
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Determining where an object has been is a fundamental challenge for human health, commerce, and food 
safety. Location-specific microbes in principle offer a cheap and sensitive way to determine object provenance. 
We created a synthetic, scalable microbial spore system that identifies object provenance in under 1 hour at 
meter-scale resolution and near single-spore sensitivity and can be safely introduced into and recovered from 
the environment. This system solves the key challenges in object provenance: persistence in the environment, 
scalability, rapid and facile decoding, and biocontainment. Our system is compatible with SHERLOCK, a 
Cas13a RNA-guided nucleic acid detection assay, facilitating its implementation in a wide range of applications. 


lobalization of supply chains has sub- 
stantially complicated the process of de- 
termining the origins of agricultural 
products and manufactured goods. De- 
termining the origins of these objects 
can be critical, for example, in cases of food- 
borne illness, but current labeling technolo- 
gies are prohibitively labor intensive and easy 
to subvert (7). Tools that label persons or 
objects passing through a location of inter- 
est could also be useful to law enforcement 
as acomplement to fingerprinting and video 
surveillance (2). Microbial communities offer 
a potential alternative to standard labeling 
approaches. Any object gradually adopts the 
naturally occurring microbes present in its 
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environment (3, 4), so it has been suggested 
that the microbial composition of an object 
could be used to determine its provenance (5). 
Challenges with this approach include variabil- 
ity of resident microbial community abundance 
over time; similarities of microbial composition 
between different locations; and the requirement 
for extensive, expensive, and time-consuming 
mapping of natural environments. 

To circumvent these challenges, we propose 
the deliberate introduction and use of syn- 
thetic, nonviable microbial spores harboring 
barcodes that uniquely identify locations of 
interest (e.g., food production areas). These 
synthetic spores would offer a sensitive, in- 
expensive, and safe way to map object prov- 
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enance provided that several important criteria 
are met, including: (i) the microbes must be 
compatible with growth at industrial scale; 
ii) the synthetic spores must be biocontained 
and not viable in the wild to prevent adverse 
ecological effects; (iii) the synthetic spores 
must persist in the environment and reliably 
label objects that pass through it; and (iv) 
the encoding and decoding of information 
about object provenance must be rapid, sen- 
sitive, and specific. Similar barcoding ap- 
proaches have been explored previously to 
model pathogen transmission (6, 7) but did 
not explicitly address those challenges. Here, 
we report the barcoded microbial spores 
(BMS) system, a scalable, safe, and sensitive 
system that uses DNA-BMS mixtures to per- 
mit the determination of object provenance 
(Fig. 1A). 

The BMS system leverages the natural abil- 
ity of spores to persist for long periods in the 
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Fig. 1. BMS can be specifically and sensitively detected. (A) Schematic of the BMS application and detection pipeline. (B) Heatmap of endpoint fluorescence 
values from in vitro SHERLOCK reactions of all combinations of 22 barcodes and 22 crRNAs assessing specificity of each barcode-crRNA pair. (C) Detection limit of 

B. subtilis and S. cerevisiae BMS by SHERLOCK (each of the eight biological replicates for each spore concentration are shown). Spore numbers are calculated on a per- 
reaction basis. (D) Heatmap of endpoint fluorescence values from in vitro SHERLOCK reactions testing the specificity of four barcodes for group 1 crRNA and four 
barcodes for group 2 crRNA as detected by either specific or group crRNA. 
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Fig. 2. Persistence, transferability, and maintenance of BMS. (A) BMS persisted 
on sand, soil, carpet, and wood over 3 months on ~1-m? test surfaces in incubators. 
The y-axis shows the B. subtilis BMS number relative to week 1 levels. Bars represent 
standard deviations. Perturbations were simulated wind, rain, vacuuming, or 
sweeping. (B) Photograph and schematic of a large-scale (~100-m?) sandpit. 

B. subtilis BC-24 and BC-25 BMS and S. cerevisiae BC-49 and BC-50 BMS were 
inoculated in the shaded gray region in the diagram. The yellow shadow represents 
the area over which the top 5 cm of sand from a 1.5-m* area of the inoculation 
region was redistributed after a large fan fell over. (©) SHERLOCK signal from the 
four BMS from the inoculated region “a” in the large-scale experiment. Dashed line is 
the threshold for positive calls. BC-19 is the negative control. (D) BMS persist at 
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collection point “a” (within the inoculated region) and do not spread to collection 
points “d” or “e” Heatmap depicts the number of BMS (out of four) detected 

by SHERLOCK at each collection point over 13 weeks. (E) BMS persist on grass in an 
outdoor environment for at least 5 months. The grass region was inoculated 

with B. subtilis BC-14 and 15 BMS. Samples from actual BMS-inoculated region 3.7, 
7.3, and 30.5 m away from the inoculated region were tested by SHERLOCK using 
crRNA 14 and 15 (each of the five biological replicates for each grass region are 
shown). (F) BMS were transferred onto shoes by walking in the inoculated region “a” 
in the sandpit and were detected by SHERLOCK. (G) Abundance of BC-25 BMS 

on shoes after up to 240 min of walking on noninoculated outdoor areas. The y-axis 
shows the BMS count based on the qPCR standard curve. 
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Fig. 3. Determining object provenance using BMS and field-deployable detec- 
tion. (A) Schematic of experiment design and field-deployable method to determine 
previous locations of an object. Each region was inoculated with one, two, or four 
nonredundant BMS. Green outlines indicate the object path through a subset of the 
regions. SHERLOCK reactions were imaged using a mobile phone camera to 
photograph the reaction plate through a filter and under portable blue light 


illumination. (B and ©) (Top) Path of an object overlaid 


environment without growth (8). We designed 
nonredundant DNA barcodes and integrated 
them into the genomes of Bacillus subtilis 
and Saccharomyces cerevisiae spores, creating 
a set of BMS that can be used combinatorially 
to provide a nearly infinite set of identifica- 
tion codes. The BMS can be manufactured 
at scale using standard cloning and cultur- 
ing techniques, inoculated to surfaces by spray- 
ing, and transferred to objects that come into 
contact with the inoculated surface. To id- 
entify barcodes, BMS sampled from objects can 
be lysed and decoded with a range of methods 
including SHERLOCK, a recombinase poly- 
merase amplification (RPA) method, coupled 
with a Cas13a-based nucleic acid detection 
assay (9), quantitative polymerase chain re- 
action (qPCR), and sequencing (Fig. 1A and 
fig. SIA). 

The BMS are designed not to affect the na- 
tive environment into which they are applied. 
First, we used auxotrophic strains that require 
amino acid supplementation for growth. Sec- 


ond, we made the cells germination deficient. 
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on a reaction plate. 


For B. subtilis spores, we deleted the genes 
encoding the germinant receptors and the 
genes that encode the cell wall lytic enzymes 
required to degrade the specialized spore 
cell wall. Incubation of >10” spores generated 
from this mutant strain showed they were un- 
able to form colonies or grow in rich medium 
and remained stable and nongerminating at 
room temperature for >3 months (fig. S2, A 
and C). For S. cerevisiae, we boiled spores for 
30 min to heat-kill vegetative cells and spores 
before application. Incubation of >10° boiled 
spores on rich medium yielded no colonies 
(fig. S2, B and D to F). All antibiotic resistance 
cassettes used to generate the BMS were re- 
moved by site-specific recombination to pre- 
vent horizontal gene transfer of resistance 
genes to other organisms in the environ- 
ment. Finally, the inserted barcode does not 
encode any gene and should not confer any 
fitness advantage if horizontally transferred. 
We tracked the microbiome of soil samples 
and found that inoculation with BMS had 
insignificant effects on the microbiome com- 
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(Bottom) Photograph of SHERLOCK reaction plate overlaid with correct or incorrect 
calls. The call for each region is denoted by color (yellow check: true positive, blue 
cross: false negative, purple cross: false positive). (D) Statistics for SHERLOCK 
provenance predictions of objects traversing regions inoculated with one, two, or four 
BMS per region. The false-positive and false-negative rates for one and two BMS 
per region were based on the criteria of one or more positive calls; the rates for four 
specific BMS per region were based on the criteria of two or more positive calls. 


pared with natural changes over time or in 
response to watering (fig. S3). We also note 
that B. subtilis and S. cerevisiae are both 
commonly found in environmental and food 
samples. 

Multiple BMS can be applied and then de- 
coded simultaneously. We designed a series of 
tandem DNA barcodes, each with a Hamming 
distance of >5, allowing >10° possible indi- 
vidual barcodes. To test the specificity of our 
barcode design in a field-deployable system, 
we constructed 22 barcodes and their match- 
ing CRISPR RNAs (crRNAs) and assayed all 
permutations in vitro using SHERLOCK. All 
22 crRNAs clearly distinguished the correct 
barcode target (Fig. 1B). To scale the system, 
we devised a rapid and facile method to screen 
a large number of barcodes and crRNAs in 
parallel to eliminate those with cross-reactivity 
or background. We validated and performed 
pooled n-7 barcode RPA reactions in vitro with 
corresponding crRNA and water RPA con- 
trols testing 94 crRNA-barcode pairs, elim- 
inating 17 for high background and seven 
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Fig. 4. Determining the prove- 
nance of produce using BMS. 

(A) Eighteen plants were inoculated 
with distinct B. subtilis BMS and 
inoculated once per week (four times 
total). (B) Detection of BMS on 
ants and soil after harvesting by 
HERLOCK with a group crRNA. The 
y-axis shows endpoint fluorescence 
values. Plants A to S were sprayed 
with BMS; plant T was not sprayed. 
(-), negative control without DNA 
template; (+), positive control from 
DNA. Dashed line is the threshold 
for positive calls. (C) Leaves from 
plants inoculated with different BMS 
were mixed together, SHERLOCK was 
used to confirm the presence of 

the BMS, and then Sanger sequenc- 
ing was used to identify the origin 
of each leaf. (D) Leaves were 
screened for the presence of BMS by 
SHERLOCK using a group crRNA. 
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for cross-reactivity (fig. S1, B and C). To test 
sensitivity and specificity in vivo, we inte- 
grated 57 barcodes into B. subtilis and 11 into 
S. cerevisiae. We developed an efficient spore 
lysis protocol using heat and sodium hydrox- 
ide (fig. S4), which allowed us to achieve near 
single-spore resolution for detection using 
SHERLOCK (Fig. 1C). In vivo and in vitro spe- 
cificity screenings of crRNA-barcode pairs 
gave similar results (fig. S1, C and D). In ad- 
dition, our barcodes are tandemly designed 
with a specific sequence and a shared group 
sequence (fig. SIA) to aid in high-throughput 
detection settings where only a subset of sam- 
ples contains the BMS of interest. The group 
sequence is compatible with field-deployable 
detections and can be used to determine 
whether a BMS of interest is present before 
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28 


BC-23 mixed leaf TCT GGCTGACACGGACTAGTAATATTGC 
BC-23 reference TCTGGCTGACACGGACTAGTAATATTGC 


using a second assay to specifically identify the 
BMS (Fig. 1D). This two-step process solves the 
throughput limitations of field-deployable de- 
tection and lowers the costs of sequencing. 
The BMS system is robust and can function 
on different surfaces in simulated real-world 
environments. First, in ~1-m?-scale experiments 
(fig. S5A and table S6), we used qPCR to de- 
tect and quantify BMS directly from surface 
samples or surface swabs. We found that BMS 
persisted on sand, soil, carpet, and wood sur- 
faces for at least 3 months with little to no 
loss over time (Fig. 2A and fig. $5, B and C). 
Notably, multiple tested perturbations (e.g., 
simulated wind, rain, vacuuming, or sweep- 
ing; see fig. S5A) did not reduce our ability 
to detect BMS from the surface. Second, we 
constructed an ~100-m? indoor sandpit (Fig. 
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BC-25 mixed leaf NNNGGGGCAACGGACGTGAGG CGTCATC 
~ BC-25 reference TAGGGGGCAACGGACGTGAGGCGTCATC 


2B and fig. S6), inoculated one region with 
BMS (Fig. 2C), and were able to readily detect 
the BMS for 3 months using SHERLOCK (Fig. 
2D and fig. $7). Perturbations did not cause 
appreciable spreading to noninoculated areas 
(Fig. 2D and fig. $7); even a catastrophic 
disturbance in which a fan fell into the sand, 
only spread the BMS several meters (fig. S6). 
In an outdoor environment, BMS inoculated 
on grass was still detectable after 5 months 
of exposure to natural weather, with minimal 
spreading outside of the inoculated region 
(Fig. 2E). This is consistent with low levels of 
re-aerosolization reported for other spore- 
forming Bacilli (0). 

The BMS can be transferred onto objects 
that pass through test environments. In ~1-m?- 
scale testing, BMS could be transferred onto 
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rubber or wooden objects simply by placing 
them on the BMS-inoculated surface for sev- 
eral seconds, yielding up to ~100 spores/ul of 
reaction input (fig. S5D). At ~100-m? scale, 
BMS were reliably transferred onto shoes 
worn in the inoculated sandpit (Fig. 2F and 
fig. S8). Furthermore, the BMS transferred 
onto shoes could still be detected even after 
walking on noninoculated surfaces for sev- 
eral hours, although BMS counts decreased 
by twofold with 2 hours of walking as quan- 
tified by qPCR (Fig. 2G and fig. S9). We were 
unable to detect spores on noninoculated 
surfaces after walking on them with shoes 
that had traveled through BMS-inoculated 
regions (fig. S10). We conclude that the BMS 
can persist in the environment without signif- 
icant spreading, are transferable onto objects 
that pass through the environment, are re- 
tained on these objects, and can be sensitively 
and specifically detected using SHERLOCK. 

The BMS system can be used to label spe- 
cific locations of interest to determine whether 
a person or object has passed through them. 
We divided different surfaces into grids; in- 
oculated each grid region with one, two, or four 
nonredundant BMS (Fig. 3A); and traversed 
them with different test objects (e.g., shoes). 
To mimic in-field deployment, we used a por- 
table light source, an acrylic filter, and a mob- 
ile phone camera to image the SHERLOCK 
readout (Fig. 3A and fig. S11A) and to de- 
termine object provenance (Fig. 3, B and C, 
and figs. S11 to $13). Provenance could be 
determined in the field within ~1 hour from 
sample collection. To evaluate the sensitivity 
and specificity of our system for determining 
an object’s provenance, we considered dif- 
ferent criteria for classification with varying 
numbers of BMS per region. Object provenance 
could be determined with a 0.6% (1/154) false- 
positive rate and a 0% (0/62) false-negative rate 
if regions were inoculated with four BMS based 
on the criteria of two or more positive BMS 
calls. Inoculating with only one or two BMS 
per region still permitted object provenance 
to be determined, albeit at the higher error 
rates (Fig. 3D and fig. S14). Provenance could 
be determined on all four surface types tested 
(sand, soil, carpet, and wood) (fig. $13); further 
validation will be needed to determine error 
rates in real-world environments. This exper- 
iment demonstrates that the BMS can be used 
to determine object provenance at meter-scale 
resolution, which would be extremely dif- 
ficult to achieve using natural microbiome 
signatures (11). 

The BMS system offers a flexible and com- 
prehensive approach to determining food 
provenance. Foodborne illness is a global 
health issue with an estimated 48 million 
cases each year in the United States alone 
(72). There is an urgent need for rapid meth- 
ods for identifying the source of food con- 
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tamination; current approaches often take 
weeks and are costly because of the com- 
plex modern market chain (13). Plants in- 
oculated with B. subtilis BMS allowed us to 
map laboratory-grown leafy plants back to 
the specific pot in which they were grown 
(Fig. 4, A and B, and fig. S15). BMS were in- 
oculated four times, beginning 1 week after 
the first set of leaves appeared, to match re- 
commended inoculation protocols for Bacillus 
thuringiensis (Bt) spores, a U.S. Food and 
Drug Administration-approved biocide that 
is widely used in agriculture (/4). One week 
after the final BMS inoculation, a leaf and a 
soil sample from each pot were harvested 
and tested using SHERLOCK. All of the sam- 
ples were positively detected except for the 
two plants that had received variant group 
barcode sequences, demonstrating the spec- 
ificity of detection (Fig. 4B and fig. S15, B 
and C). Using Sanger sequencing, we then 
identified the pot in which each plant was 
grown for all 18 BMS (fig. SI5D). The process 
from DNA extraction to sequence identifica- 
tion took <24 hours. This time frame could 
likely be shortened to hours with massively 
parallelized hybridization-based detection (15). 

Cross-association of BMS-inoculated plants 
does not compromise the determination of 
provenance. To simulate cross-association 
that could occur during food processing, we 
mixed leaves from plants that were inocu- 
lated with a specific BMS per plant. Unlike 
the other surfaces that we inoculated, BMS- 
inoculated plants did not transfer as easily to 
objects that came into contact with the plants 
(fig. S5 versus fig. S16, A and B). Although 
there was detectable transfer between leaves, 
the amount of transfer still allowed Sanger 
sequencing to cleanly determine the origin of 
each leaf (Fig. 4, C to E, and fig. $16, C and D). 

Bt could be used to determine food prov- 
enance. We used Bt spores applied during 
farming as a surrogate to test whether BMS 
would persist through conditions of a real- 
world food supply chain. For plants of known 
Bt inoculation status, we correctly identified 
all Bt-positive and -negative plants (38 total 
plants) (fig. S17). Furthermore, we detected Bt 
on 10 of 24 store-bought produce items of a 
priori unknown Bt status (figs. S17B and S18). 
Unexpectedly, BMS and Bt spores remained 
detectable even after washing, boiling, fry- 
ing, and microwaving (fig. S19), highlighting 
the potential to determine provenance from 
cooked foods. These results show the poten- 
tial for using the BMS system to determine 
produce provenance. 

Our work shows how rationally engineered 
microbial spores can be manufactured in 
a high-throughput manner to provide a 
new solution to the object provenance prob- 
lem. We have shown that BMS (i) persist in 
the environment; (ii) do not spread out of 


the inoculation area; (iii) transfer from soil, 
sand, wood, and carpet to contacting ob- 
jects; and (iv) permit sensitive and rapid read- 
out using laboratory and field-deployable 
methods. The ability to rapidly label objects 
and determine their provenance in real-world 
environments has a broad range of applica- 
tions across agriculture, commerce, and fo- 
rensics (2). Preliminary data suggest that BMS 
could work across various environments, al- 
though extensive validation in a wider range 
of real-world conditions is needed. Future 
iterations of our BMS system could be en- 
gineered for limited propagation and ac- 
tively contained for use in highly trafficked 
areas. This system could also provide time- 
resolved information about location history, 
making it useful for an even wider range of 
applications. 
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PLASTIC POLLUTION 


Seafloor microplastic hotspots controlled 


by deep-sea circulation 


lan A. Kane**, Michael A. Clare”, Elda Miramontes**, Roy Wogelius?, James J. Rothwell®, 


Pierre Garreau®, Florian Pohl” 


Although microplastics are known to pervade the global seafloor, the processes that control their 
dispersal and concentration in the deep sea remain largely unknown. Here, we show that thermohaline- 
driven currents, which build extensive seafloor sediment accumulations, can control the distribution 

of microplastics and create hotspots with the highest concentrations reported for any seafloor setting 
(190 pieces per 50 grams). Previous studies propose that microplastics are transported to the seafloor by 
vertical settling from surface accumulations; here, we demonstrate that the spatial distribution and ultimate 
fate of microplastics are strongly controlled by near-bed thermohaline currents (bottom currents). These 
currents are known to supply oxygen and nutrients to deep-sea benthos, suggesting that deep-sea 
biodiversity hotspots are also likely to be microplastic hotspots. 


lastic pollution has been observed in 

nearly all environments on Earth (7) and 

across all of its oceans (2-4). The effects 

of plastic pollution on marine ecosystems 

and implications for human health are 
of growing concern, as more than 10 million 
tonnes of plastic enter the global ocean each 
year (4-6). Converging surface currents in 
oceanic gyres are responsible for the global dis- 
tribution of plastics on the ocean surface (2, 3). 
These gyres effectively concentrate positively 
buoyant plastics into the now-infamous “gar- 
bage patches” (2, 3). However, sea surface accu- 
mulations only account for ~1% of the estimated 
global marine plastic budget (3, 4, 7, 8). Most 
of the remaining 99% of plastic ends up in the 
deep sea (7-9) (Fig. 1A). A considerable pro- 
portion [estimated at 13.5% (8)] of the marine 
plastic budget occurs as microplastics: small 
(<1 mm) fragments and fibers (10, 17) that 
originate as manufactured particles (72, 73) or 
are derived from synthetic textiles (74) or the 
breakdown of larger plastic debris (15). It has 
been shown that larger plastic debris may be 
associated with dense down-canyon flows in the 
Mediterranean (6). The seafloor is a globally 
important sink for plastics; however, the phys- 
ical controls on the distribution of microplas- 
tics and the effectiveness of their sequestration 
once deposited at the seafloor remain unclear 
(7, 10, 17-23). Owing to their small size, micro- 
plastics can be ingested by organisms across 
all trophic levels, enabling transfer of harmful 
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toxic substances (9, 10, 22). Therefore, determin- 
ing where microplastics accumulate and their 
availability for incorporation into the food chain 
is fundamental to understanding threats to glob- 
ally important deep-seafloor ecosystems (24). 

Rather than corresponding to the extent 
of overlying surface garbage patches, micro- 
plastics on the deep seafloor are preferen- 
tially focused within distinct physiographic 
settings (7, 19). Submarine canyons and deep- 
ocean trenches, which are foci for episodic yet 
powerful gravity flows, appear to be micro- 
plastic hotspots (7, 20, 25-27) (Fig. 2A). This 
physiographic bias suggests that the trans- 
fer of microplastics to and across the deep 
seafloor is therefore not solely explained by 
vertical settling from surface gyres. It is likely 
that the role of deep-sea currents in the dis- 
persal and concentration of microplastics 
is similar to that of surface currents (26, 28), 
yet a paucity of contextual data (e.g., bathy- 
metric, oceanographic, and sedimentologi- 
cal) hinders the linkage of physical transport 
processes to the distribution and ultimate 
fate of microplastics. Thermohaline currents 
acting on the seafloor are one of the most im- 
portant processes for the deep-sea transport 
of fine-grained particles and build some of 
the largest sediment accumulations on our 
planet [called contourite drifts (29)] (Fig. 1B), 
but their role in sequestering microplastics 
remains unknown. 

Here, we link microplastic pollution on the 
seafloor to bottom currents by integrating 
high-resolution geophysical data, sediment 
sampling, microplastics analysis, and numer- 
ical modeling. The Tyrrhenian Sea was selected 
as the study area because (i) the dimensions 
and grain size of its physiographic elements 
are broadly comparable to those of many 
global settings (29-32); (ii) its ocean circula- 
tion patterns and velocities are comparable 
to currents globally (31, 33); (iii) its plastic 
input volumes and locations are well con- 


strained (34); and (iv) high-resolution sea- 
floor and ocean circulation data afford the 
spatial and temporal context to investigate 
our key questions. We analyze data from the 
Tyrrhenian Sea, where ocean water circula- 
tion is driven by the East Corsican Current 
and its return branch (Figs. 1C and 2E), which 
reach local velocities of >0.4 m s”7 near the 
surface and >0.2 ms near the seafloor (30). 
The strongest bottom currents generally occur 
between 600 and 900 m water depth, where 
they actively sculpt extensive muddy con- 
tourite drifts (Fig. 1D) [<10 km wide and up 
to hundreds of meters thick (37)]. The con- 
tinental shelf is indented by the Caprera slope 
channel system, which extends downslope to 
the Olbia basin (32) (Fig. 1C). Terrestrial sedi- 
ment is delivered to the shelf by high-gradient 
rivers passing through rural, urban, and in- 
dustrial catchments and accounts for ~80% 
of the marine plastic budget in the region, 
with the remainder from shipping and fishing 
activities (6-8, 34, 35, 36) (Fig. 1, A and C). In 
this study, we address three questions: How 
important are bottom currents for the disper- 
sal and accumulation of microplastics on the 
deep seafloor? How do variations in bottom 
current intensity control the spatial distribu- 
tion of microplastics at the seafloor? And how 
efficiently are microplastics sequestered after 
their emplacement at the seafloor? 

All seafloor samples were found to contain 
microplastics (Fig. 3B and table SJ), as verified 
with optical microscopy and Fourier trans- 
form infrared (FTIR) spectroscopy (fig. S2). 
Microplastics presented in two forms, as fibers 
(70 to 100%) and as fragments (0 to 30%) 
(Fig. 2, B and C). Microplastic concentration in 
the Tyrrhenian Sea includes the highest values 
yet recorded from the deep seafloor (Fig. 2A): 
up to 182 fibers and nine fragments per 50 g 
of dried sediment (191 total pieces per 50 gin 
core 6, equivalent to ~1.9 million pieces per 
square meter) were recorded in the contourite 
drift at the base of the northeast Sardinian 
continental slope (Fig. 3, B and C). This con- 
centration exceeds the highest levels previ- 
ously recorded, including those from deep-sea 
trenches, and is more than double that doc- 
umented in submarine canyons (27, 37-39) 
(Fig. 2A). As contourite drifts occur on most of 
Earth’s continental margins (29) (Fig. 1B), the 
high concentrations recorded here strongly 
suggest that these drifts are globally impor- 
tant repositories for microplastics. 

In our study area there is no relationship 
between microplastic concentrations and dis- 
tance from terrestrial plastic sources (Fig. 2B). 
Samples from the continental shelf (38 fibers 
and 3 fragments; core 9) and upper slope 
(8 fibers and 1 fragment; core 11) have some of 
the lowest concentrations reported in the 
study area. Instead, we show that microplas- 
tics are focused within a water depth range of 
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600 to 900 m, where bottom currents form 
seafloor gyres and have the greatest interac- 
tion with the seafloor (Fig. 2C). The influence 
of these currents and the complex topographic 
relief (Fig. 2E) result in spatial variations 
in the shear stress exerted on the seafloor, as 
determined from hydrodynamic modeling 
(Fig. 3B). These variations in shear stress 
explain the localized seafloor distribution of 
microplastic particles, which typically have 
lower densities than silt- and sand-forming 
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minerals and therefore are more easily en- 
trained (39), accounting for depleted levels 
of microplastic in certain physiographic do- 
mains and concentration in others (Figs. 2D 
and 3C). The lowest concentrations of micro- 
plastics are found in contour-parallel moats, 
which are foci for erosion and/or nondeposi- 
tion (e.g., 28 fibers and 3 fragments per 50 gin 
core 16) (Fig. 3C). Higher concentrations occur 
on the adjacent mounded drift (e.g., 86 fibers 
and 1 fragment in core 8) and also in other 
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Fig. 1. Plastic sources and ocean circulation 
affecting the Tyrrhenian Sea. (A) Location 

of study area in the Tyrrhenian Sea, annotated 
with published terrestrial (~80%) and maritime 
(fishing and shipping; ~20%) plastic sources 
(34). Terrestrial input sources shown as 

circles and diamonds. Vessel traffic and 
shipping lanes shown as dashed lines. Modeled 
(and hence inferred) plastic debris fluxes on 
the Mediterranean seafloor illustrated as colored 
shading. The period modeled was 2013-2017, 
assuming vertical settling from surface 
distributions using a two-dimensional Lagrangian 
model (34). Inferred values in the northern 
Tyrrhenian Sea are <7 g km™ day, which are 
low compared with those of the wider 
Mediterranean Sea. (B) Global distribution 

of documented contourite depositional systems 
(29) shown in red. (C) Seafloor bathymetry 

of the northern Tyrrhenian Sea annotated with 
documented terrestrial plastic input sources 
(6), named physiographic features (32), and 
seafloor sediment samples analyzed in this 
study (colored according to physiographic 
domain). The regional pattern of thermohaline- 
driven currents near the seafloor is shown 

by white arrows. Along-shore drift on the 
Corsican and Sardinian continental shelves is 
shown by a yellow dashed arrow. Line X-X' 
shows the location of the multichannel seismic 
line in (D), which illustrates the depositional 
features that have developed as a result of 
bottom currents, including the formation of thick 
mounded drifts, and inhibited sediment accu- 
mulation in moats. TWT, two-way travel time. 


mounded drift accumulations (e.g., 88 fibers 
and 6 fragments in core 2) (Fig. 3, B and C). 
Although microplastic abundance is gen- 
erally higher where bottom currents occur, it 
appears that there is a threshold bed shear 
stress above which microplastics no longer 
become concentrated at the seafloor. Mod- 
eling of particle transport under the ranges 
of shear stresses determined from the hy- 
drodynamic model indicates that microplas- 
tics are likely to be remobilized and potentially 
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transported as bedload at shear stresses in 
excess of 0.03 to 0.04 N m™” (Figs. 3A and 4). 
Areas of higher shear stress are observed on 
the shelf break, upper continental slope, and 
particularly in contourite moats, where the 
bottom current is most vigorous (Fig. 3B). We 
propose that such contour currents are effec- 


tive agents for the transport of microplastics 
and that, while microplastics are generally 
flushed along-slope through contourite moats, 
they preferentially accumulate in the adjacent 
contourite drifts, forming anomalously con- 
centrated microplastic hotspots. This appears 
to be particularly true for mounded contourite 
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Fig. 2. Global and local abundance of seafloor microplastics. (A) Comparison of microplastic fiber 
abundance in different deep-sea settings worldwide (see supplementary materials), demonstrating the high 
concentrations observed in the contourite drift deposits in the Tyrrhenian Sea. (B to D) Results from the 
Tyrrhenian Sea illustrating that (B) microplastic abundance does not decrease with distance from terrestrial 
input sources, (C) microplastics appear to be concentrated within a depth range of ~600 to 900 m, and 
(D) microplastic concentration is biased toward physiographic domains, particularly mounded drifts, and 
depleted within moats. (E) A three-dimensional rendering (perspective view looking toward the southwest) of 
the regional European Marine Observation and Data Network (EMODNet) bathymetry (grayscale) and 
autonomous underwater vehicle (AUV) bathymetry (colored) shows the relationship of sediment samples to 
the local seafloor currents (10x vertical exaggeration). Horizontal distance from bottom left core (red circle) to 
top left (green circle) is ~100 km. Compare to map view in Fig. 1C. Inset microscope photographs show 
representative examples of fibers and fragments extracted from the sediment. 
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drifts, which are often sites of extremely high 
sediment accumulation relative to the rest of a 
continental slope (29). 

Previous Lagrangian modeling of micro- 
plastic transport in the Mediterranean [based 
on a model used to describe global microplas- 
tic distributions (3)] suggests that waves and 
sea-surface currents ought to transport micro- 
plastics away from the Tyrrhenian Sea (34). 
Therefore, the bottom plastic flux (assuming 
vertical settling only) in this basin should be 
one of the lowest: 1.5 to 7 g km ~” day |, com- 
pared with a regional maximum of 70 g km? 
day’ elsewhere in the Mediterranean (Fig. 
1A). If that modeling is correct, microplastic 
abundances elsewhere in the Mediterranean 
may be even higher than the values we report 
here. We suggest, however, that both bottom 
currents and surface currents are important for 
the concentration of microplastics, yet near- 
bed bottom-current circulation is omitted 
from existing models (2, 3, 8, 34). Ocean cur- 
rents appear to be highly capable of diverting 
microplastics from shallow to deep water and 
may be responsible for entraining microplas- 
tics transported downslope via submarine 
channels linked to terrestrial sources (Fig. 5). 
In enclosed or semienclosed basins, such as 
the Tyrrhenian Sea and more widely the Med- 
iterranean Sea, circulating contour currents 
are likely to preferentially accumulate micro- 
plastics within contourite drifts. On open con- 
tinental slopes, contour currents may disperse 
rather than concentrate microplastics. In such 
settings, these currents may play a key role in 
their spatial segregation into hotspots. 

We have shown that the overall pattern of 
bottom currents controls the distribution of 
microplastics at the seafloor. Numerical mod- 
eling and direct measurements in the Tyrrhe- 
nian Sea reveal a pronounced seasonal variation 
in bottom current velocities, with bottom cur- 
rents being most intense in the winter (37). 
Modeling of microplastic transport shows that 
some of the near-bed bottom current shear 
stresses are close to, or in excess of, that re- 
quired to entrain both fibers and fragments 
(Figs. 3A and 4). Although low-intensity cur- 
rents in the summer may allow for the accu- 
mulation of microplastics at the seafloor, at 
some locations (contourite moats, in partic- 
ular), previously deposited microplastics may 
be reexhumed as shear stresses exceed the 
critical boundary for remobilization and sus- 
pension (Fig. 3A). More powerful but ephem- 
eral seafloor flows such as gravity currents 
that have been recorded in deep-sea subma- 
rine canyons can reach velocities two orders of 
magnitude greater than those of the bottom 
currents in the Tyrrhenian Sea (up to 20 ms”) 
and can last for several days (40-42). These 
powerful events will undoubtedly flush accu- 
mulated microplastics either farther down- 
slope (43) or loft them for recirculation by 
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Fig. 4. Relating microplastics to seafloor shear stress. Flow regime 
diagram to show that the critical shear stress required to move particles 
[i.e., above the dashed gray line (51)] falls between 0.03 and 0.04 N m™®, 
and that suspension [i.e., within or above the gray shaded area (52, 53)] 
may occur >0.1 N m™*. Two densities of microplastics are assumed 
(nylon and polyethylene) on the basis of the results of FTIR analysis. 
Three particle sizes for microplastics are shown, to represent the general 
range observed through visual microscopic examination [0.1 mm (red 
symbols), 0.5 mm (blue), and 1 mm (green) width]. The upper and lower 
bound measured grain sizes (Doo) for the host sediment are also shown 
with gray symbols. 
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Fig. 5. Bottom currents control the deep-sea fate of microplastics. Schematic diagram illustrating the role of near-bed currents in the transfer, concentration, 

and storage of microplastics in the deep sea. Along-shelf currents disperse microplastics, powerful gravity flows effectively flush microplastics to the deep sea, while 
thermohaline-driven bottom currents segregate microplastics into localized hotspots of high concentration. The effectiveness of their long-term sequestration depends 
on the intensity of subsequent bottom current activity and rate of burial. 


thermohaline bottom currents (Fig. 5). Such 
gravity flows play a globally important role in 
the lateral transport of lightweight particulate 
matter such as organic carbon (41, 44); hence, 
it stands to reason that they should also be 
important for microplastic transport. Episodic 
flushing of submarine canyons (41, 42, 45) sug- 
gests that canyons may only be temporary 
microplastic storage sites (26). This is anal- 
ogous to rivers where flooding can flush high 
levels of microplastics downstream (46). 
Bottom currents are efficient conveyors of 
nutrients and oxygen, and consequently they 
dictate the location of important biodiversity 
hotspots (41, 47-49). Unfortunately, we show 
that the same seafloor currents can also trans- 
port and emplace microplastics. The highest 
concentrations of microplastics on the seafloor 
occur in contourite drifts formed by bottom 
currents, and their distribution is controlled 
by spatial variations in current intensity. How 
effectively microplastics are buried or become 
reexhumed (and hence become more availa- 
ble for trophic transfer) depends on tempo- 
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ral fluctuations in current intensity. Although 
there are ongoing efforts to reduce the release 
of plastics into the environment, our oceans 
will continue to be affected by the legacy of 
past waste mismanagement (4, 5, 8, 50). Sea- 
floor currents will play a crucial role in the 
future transfer and storage of microplastics 
in the deep ocean. 
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By Guilherme Santos 
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No place like home 


ur lives changed tracks the day our daughter was born. My wife and I felt settled in the United 
Kingdom—we'd even bought a house 6 months earlier—but the arrival of our daughter forced 
a moment of reflection. We wondered whether we should move back to our native Brazil. I 
remember looking down at our newborn baby and thinking about how different her life would 
be there. In our native country, we’d be closer to our family and culture. But in the United 
Kingdom, I had the funding and lab resources I needed for my costly and highly specialized 
research program. Would such a move sink my career? 


We had left Brazil 10 years earlier 
so that I could spend the last year 
of my Ph.D. program in France. 
Meanwhile, my wife started a Ph.D. 
in the United Kingdom. We main- 
tained a long-distance relationship 
for the first few years. But by the 
time our daughter was born, we'd 
settled down in the same location; 
I had a research assistant position 
and my wife had a postdoc. 

I was in an amazing scientific 
environment, working next to Nobel 
laureates. But I had been away 
from my extended family and my 
culture for too long. I missed acai, 
acarajé, pao de queijo, samba, bossa 
nova, capoeira, Brazilian jujitsu, and 
sunny skies. I worried that I might 
never return to Brazil. 

My wife missed our homeland 
as well. As a mathematician, it was 
also easier for her to imagine doing 
her work in Brazil; unlike me, she didn’t need fancy lab 
equipment and expensive reagents. So with a newborn baby 
at home, we started to ask ourselves whether a move was 
in order. 

Several events over the next few months nudged us fur- 
ther in the same direction. We heard that new professor 
positions had opened up at a university in our hometown, 
in departments that matched both of our research pro- 
grams. Then, a few weeks later, I received a call in the 
wee hours of the morning from my brother, who told me 
that our father had passed away. The great happiness I 
had felt about my daughter’s birth suddenly gave way to 
a deep sadness. I felt that I was losing precious time with 
my loved ones. 

After that, we had no doubt it was the right time to re- 
turn to Brazil. We applied for the professorships and both 
received offers. It felt like our personal and professional 
lives were falling into place. 


“| had been away from 
my extended family and my 
culture for too long.” 


It wasn’t easy getting started with 
my research in Brazil, however. I 
didn’t receive any startup funding 
for my lab, so I couldn’t recruit grad 
students or hire postdocs. I was 
able to perform a few experiments 
myself thanks to reagents I had 
brought with me from my lab in the 
United Kingdom, as well as equip- 
ment I shared with other profes- 
sors. Eventually, I secured Brazilian 
funding, which allowed me to form 
a team of bright undergraduate and 
graduate students and buy reagents 
and lab equipment on my own. 

Nine years later, I still haven’t 
been able to perform the kind of ex- 
periments that U.K. funding would 
have allowed. And the antiscience 
views of the current Brazilian gov- 
ernment have added to my worries 
about my squeezed budget. 

But looking back, I’m happy that 
my wife and I put our personal well-being first and made 
the decision to move back to our home country. We’ve been 
able to have big meals with our extended family on week- 
ends. We had a second child, and our kids are growing up 
close to their cousins who are around the same age. And I 
was able to resume Brazilian jujitsu training, which I hadn’t 
been able to do during my years abroad. 

We are living the lives we wanted to live—working on 
rewarding research while staying true to what we feel is 
best for ourselves and for our family. Any scientist from a 
developing country who is considering a move home has 
to weigh the pros and cons, and the situation will be dif- 
ferent for everyone. But for us, returning home was worth 
the costs. ! 


Guilherme Santos is an associate professor at the University 
of Brasilia. Do you have an interesting career story to share? 
Send it to SciCareerEditor@aaas.org. 
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